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Abstract. We consider the random 2-satisfiabiUty problem, in which each instance is a formula 
that is the conjunction of m clauses of the form x V y, chosen uniformly at random from among 
all 2-clauses on n Boolean variables and their negations. As m and n tend to infinity in the 
ratio m/n — > a, the problem is known to have a phase transition at Oc = 1, below which the 
probability that the formula is satisfiable tends to one and above which it tends to zero. We 
determine the finite-size scaling about this transition, namely the scaling of the maximal window 
W{n, 6) = (a_(n, 6), a+(n, 6)) such that the probability of satisfiability is greater than 1 — 6 for 
a < a_ and is less than 5 for a > a+. We show that 

W{n, 5) = il- G(n-i/3)^ ^ ^ Q{n~^/^)), 

where the constants implicit in Q depend on 6. We also determine the rates at which the 
probability of satisfiability approaches one and zero at the boundaries of the window. Namely, 
for m = (1 + e)n, where e may depend on n as long as |e| is sufficiently small and |e|n"^/^ is 
sufficiently large, we show that the probability of satisfiability decays like exp (—0 (ne^)) above 
the window, and goes to one like 1 — (n~^|e|~'^) below the window. We prove these results 
by defining an order parameter for the transition and establishing its scaling behavior in n both 
inside and outside the window. Using this order parameter, we prove that the 2-SAT phase 
transition is continuous with an order parameter critical exponent of 1. We also determine the 
values of two other critical exponents, showing that the exponents of 2-SAT are identical to 
those of the random graph. 



Keywords: 2-SAT, satisfiability, constraint satisfaction problem, phase transition, finite-size 
scaling, critical exponents, random graph, order parameter, spine, backbone. 
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1. Introduction and Statement of Results 

There has recently been interest in a new field emerging at the intersection of statistical 
physics, discrete mathematics, and theoretical computer science. The field is characterized 
by the study of phase transitions in combinatorial structures arising in problems from 
theoretical computer science. 

Perhaps the most interesting phenomena in statistical physics are phase transitions. 
These transitions occur in systems with infinitely many degrees of freedom, i.e. sys- 
tems specified by infinitely many random variables. Physically, the transitions repre- 
sent changes in the state of the system; mathematically, the transitions are manifested 
as nonanalyticities in relevant functions of an external control parameter, such as the 
temperature. In systems with a large but finite number of degrees of freedom, one can 
study the approach to nonanalytic behavior. This study is called finite-size scaling. In 
systems with continuous phase transitions characterized by critical exponents, the form 
of the finite-size scaling turns out to be related to these exponents. 

Discrete mathematics often focuses on the study of large combinatorial structures. 
Random versions of these structures (with respect to natural distributions) are discrete 
systems with large but finite numbers of degrees of freedom. In the limit of an infinite 
number of degrees of freedom, these systems can and often do undergo phase transitions. 
The study of threshold phenomena emerging in these large combinatorial structures is 
therefore analogous to finite-size scaling in statistical physics. 

The theory of complexity focuses on the difficulty of solving certain combinatorial 
problems which arise naturally in theoretical computer science. The complexity of a 
given problem is determined by the difficulty of solving any instance of the problem (i.e., 
in the worst case). Researchers have also studied randomly chosen instances of certain 
problems, and determined average- or typical-case complexity. However, even when it 
is determined that a problem is easy or hard on average, it is not clear what properties 
characterize the hard instances. 

The convergence of these three disciphnes is a consequence of the recent observation that 
one can define control parameters in terms of which certain theoretical computer science 
problems undergo phase transitions, and the even more interesting observation that the 
hardest instances of these problems seem to be concentrated at the phase transition point. 
The problem for which this phenomenon has been studied most extensively is the /c-satis- 
fiability problem. Our work is the first complete, rigorous analysis of finite-size scaling 
for a satisfiability problem. 

The /c-satisfiability (/c-SAT) problem is a canonical constraint satisfaction problem in 
theoretical computer science. Instances of the problem are formulae in conjunctive normal 
form: a A;-SAT formula is a conjunction of m clauses, each of which is a disjunction of 
length k. The k elements of each clause are chosen from among n Boolean variables and 
their negations. Given a formula, the decision version of the problem is whether there 
exists an assignment of the n variables satisfying the formula. 
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It is known that the A;-satisfiabihty problem behaves very differently for k = 2 and 
k > 3 ||Uoo71|| . For k = 2, the problem is in P [ poo71|| ; indeed, it can be solved by a 
linear time algorithm | AP'1'79|| . For k > 3, the problem is NP-complete ||(Joo71 |, so that 



in the worst case it is difficult to determine whether a fc-SAT formula is satisfiable or not 
— assuming P 7^ NP. Note, however, that even for k = 2, variants of the fc-SAT problem 
are difficult. For example, the MAX-2-SAT problem, in which one determines whether 
the maximum number of satisfiable clauses in a 2-SAT formula is bounded by a given 
integer, is an NP-complete problem ||GJS76|| (see also ||GJ79|| ), and even approximating it 



to within a factor of 4/3 — e is NP-hard |[Has97| . 

More recently, it has been realized that — rather than focusing on worst-case instances — 
it is often useful to study typical instances of the fixed-fc problem as a function of the 
parameter a = m/n. Consider the random /c-SAT problem, in which formulae are gen- 
erated by choosing uniformly at random from among all possible clauses. As m and n 
tend to infinity with limiting ratio m/n — > a, considerable empirical evidence suggests 
that the random fc-SAT problem undergoes a phase transition at some value adk) of 
the parameter a ( ||MSL92|| , ||CA93|| , [[L'l'93|| , |[KS94|| ): For a < Oc, a random formula is 
satisfiable with probability tending to one as m and n tend to infinity in the fixed ratio 
a = m/n, while if a > ac, a random formula is unsatisfiable with probability tending to 
one as m and n tend to infinity, again with m/n — *■ a. 

Existence of the phase transition is on a different footing for = 2 and k > 3. For k = 2, 
it was shown by Goerdt ( [poe92|| , ||Goe96|| ), Chvatal and Reed ||CR92|| , and Fernandez de 
la Vega |Jl^'er92|| that a transition occurs at ac{2) = 1. For A; > 3, it may not be possible to 



locate the exact value of the transition point. However, there has been considerable work 
bounding the value of the presumed 3- SAT threshold from below and above. Using a suc- 
cession of increasingly sophisticated and clever algorithms for finding SAT solutions with 
high probability, lower bounds on ac(3) were improved from 1 ( ||CF86|| , ||CF90|| , ||CR92|| ) 



to 1.63 [IBFU93II to 3.003 [[FS9^ to 3.145 [|AchOO|| to 3.26 |ASOO| . Bounding the prob- 



ability of finding a solution by the expected number of solutions gave an upper bound 
on ac(3) of 5.191 |[FP83|| ; increasingly sophisticated counting arguments gave a succes- 
sion of improved upper bounds on ac{3) from 5.08 [|EF95|| to 4.758 |[KMPS95| to 4.643 
IIDB971I to 4.602 ||KKK96|| to 4.596 ||JS VOO|| . More recently a bound of 4.506 |PBM99|| has 
been announced. Although these bounds are relatively tight, they nevertheless allow for 
the possibility of a non-sharp transition. However, motivated by the empirical evidence, 
Friedgut and later Bourgain showed that indeed there is a sharp transition ||FB99|| (al- 
though they did not prove that the probability of satisfiability approaches a limit). These 
proofs were based on a general argument which shows that global, as opposed to local, 
phenomena lead to sharp transitions. However, the existence of a limiting threshold is 
still an open problem. 

Having established the sharpness of the transition, the next step is to analyze some of 
its properties. Finite-size scaling is the study of changes in the transition behavior due 
to finite-size effects, in particular, broadening of the transition region for finite n. To be 
precise, for < 5 < 1, let a_(n, 5) be the supremum over a such that for m = an, the 
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probability of a random formula being satisfiable is at least 1 — 6. Similarly, let a+{n, 6) 
be the infimum over a such that for m = an, the probability of a random formula being 
satisfiable is at most 6. Then, for a within the scaling window 



W{n,6) = {a-{n,6),a+{n,6)), 



XI) 



the probability of a random formula being satisfiable is between 6 and 1 — 6. Since, by 



FB99[, for all 6, \a+{n,6) — a_{n,6)\ — >• as tt, — »• cxd, we say that the scaling window 
represents the broadening of the transition due to finite-size effects. Sometimes we shall 
omit the exphcit 6 dependence of a±{n, 6) and W{n, 6), writing instead a±{n) and W{n). 
In these cases, the power laws we quote will be uniform in 6, but the implicit constants 
may depend on 6. 

The first model for which such broadening was established rigorously is the random 
graph model. The phase transition for this model, namely the sudden emergence of a 
giant component, was already proved by Erdos and Renyi (| |li]K60| |, | pi]K61| ]). But the 
characteristic width of the transition was (correctly) investigated only 24 years later by 
Bollobas [ Bol84|| (see also |Bol85 and the references therein). In particular, this work 
showed that the width of the scaling window W{n) is 7^-1/3+0(1)- precise growth rate 
was later shown to be 6(n~^/^) by Luczak |[Luc90|| . Many additional properties of the 
phase transition were then determined using generating functions |PjPW94|| ||JKLP9^ . 
For the finite-dimensional analogue of the random graph problem, namely percolation on 
a low-dimensional hypercubic lattice, the broadening was established by Borgs, Chayes, 

BCKS98a||, BCKS98'E[] ), who also related the power law form of 



Kesten and Spencer 

a±{n) to the critical exponents of the percolation model. 

The question of finite-size scaling in the /c-SAT model was first addressed by Kirkpatrick 
and Selman [ KS94|| , who presented both a heuristic framework and empirical evidence for 
analysis of the problem. There has also been subsequent empirical ( ||SK96| , | |1V1ZKS'1' 



and theoretical ([ [MZ9(j| ], | |1V1Z97| |, [ [MZKS'1'5S| ]) work, the latter using the replica method 



familiar from the study of disordered, frustrated models in condensed matter physics (see 
1V1PV87|] and references therein). Although the theoretical work has yielded a good deal of 



insight, the empirical work on finite-size scaling has been misleading ||WilOO|| , and rigorous 
progress on finite-size scaling in fc-SAT has been quite limited. 

In this work, we address the question of finite-size scaling in the 2-SAT problem; in 
particular, we obtain the power law form of the scaling window W{n) = (a_(n), a+(?T,)), 
together with the rates of convergence at the boundaries of the window. Previous work 
on 2-SAT by Goerdt |Poe99|| has shown that a;_(n) > 1 — 0(l/v^logn), while Verhoeven 
Ver99|| has recently obtained the result a+(n) < l+0{n^^^^). Numerical work on the scal- 
ing window for 2-SAT is somewhat controversial: While earlier simulations [MZKS'i'9y] 
suggested that the window scales like W{n) = {1 — Q(n~^^'^'^), 1 -|- 6(?t,~-'^/^''^)), recent sim- 



ulations by Wilson ||Wil9^| indicate that the 2-SAT formulae considered in |[MZKS'1'99 
are not long enough to reach the asymptotic regime.^ Indeed, we shall prove in this paper 

lAs usual, / = 6(5) means that there exist positive, finite constants ci and C2 such that ci < f / g < C2. 
Unless noted otherwise, these constants are universal. In fact, in the above formulae for W{n), the 
constants depend on S. 
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that W{n) = (1 — Q(n + Q{n ^^^)), as conjectured earlier by BoUobas, Borgs, 

Chayes and Kim |PBCK98| and predicted numerically in [|Wil98|| . We also show how the 



probability of satisfiability tends to 1 and at the edges of the window. 

In order to state our results precisely, we need a little notation. Let Xi, . . . ,Xn denote 
n Boolean variables. Writing x for the negation of x, our n variables give 2n literals 
Xi, . . . , Xn, xi, . . . , Xn- Two literals x and y are said to be strictly distinct if neither x = y 
nor X = y. A k-clause is a disjunction C = uiV ■ ■ - V Uk of k strictly distinct literals, and a 
fc-SAT formula is a conjunction F = Ci A ■ ■ • A Cm of /c-clauses Ci, . . . , Cm- We say that 
if is a subformula of F if it can be obtained from F by deleting some of its clauses. A 
fc-SAT formula F = F{xi, . . . , a;„) is said to be satisfiable, or SAT, if there exists a truth 
assignment rji G {0, 1}, z = 1, . . . , n, such that F{r]i, . . . , ?7„) = 1. Here, as usual, stands 
for the logical value FALSE, and 1 is the logical value TRUE. We write "F is SAT" if 
the formula F is satisfiable, and "F is UNSAT" if the formula F is not satisfiable. We 
also sometimes use the alternative notation SAT(F) and UNSAT (F) to denote these two 
cases. 

We consider the random 2-SAT problem in two essentially equivalent forms, given by a 
priori different probability distributions of random 2-SAT formulae on xi, . . . , Xn- First, 
we consider the probability space of formulae Fn^m chosen uniformly at random from all 
2-SAT formulae with exactly m different clauses. (Here x\/y is considered to be the same 
as i/Vx, but different from e.g. xMy.) Second, we consider the space of formulae F„^p with 
2-clauses on , . . . , chosen independently with probability p. In this introduction, we 
shall state theorems in terms of the Fn^m'-, the equivalent theorems for the F„^j, will be 
given in Section |^. The conversion between the two formulations of the problem is given 
in Appendix A. In both cases, we use ¥{A) to denote the probability of an event A. 

As usual in 2-SAT, it is convenient to study the phase transition in terms of the pa- 
rameter e representing the deviation of a from its critical value: 

m = {l + e)n. (1.2) 

When studying finite-size effects, we shall take the parameter e to depend on n. Our 
analysis shows that the appropriate scaling of e is n~^/^, so that it is natural to define yet 
another parameter A = A„ according to 

e = \nn-^'\ (1.3) 

and distinguish the cases An bounded, A„ oo and A„ — oo. 
Our main result is the following theorem. 

Theorem 1.1. There are constants Eq and Aq, < eo < 1; < Aq < c>o, such that 

r 1 - (^) tf -eon^/' < A„ < -Aq, 

P(F„,„ is SAT) = <^ e(l) z/ -Ao <\n< Ao, (1-4) 

[exp(-e(A3)) z/Ao< A„<£on^/3. 
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Note that the behaviors for A„ < and A„ > can be cast in the same form by writing 
P(F„,^ is SAT) = 1 - e(|A„|-3) = exp(-e(|A„|-3)). 

Theorem gives us the exact form of the scahng window: 

Corollary 1.2. For all sufficiently small 6 > 0, the scaling window is of the form 

W{n, S) = {1- 0(n-^/3), 1 + Q{n~^^^)), 

where the constants implicit in the definition of B depend on 6, and are easily calculated 
from equation ^1.4\ )- 

Of course, the theorem gives us more than the boundaries of the window; it also gives 
us the rates of approach of the probability of satisfiability at these boundaries. As an 
easy special case of the rate result at the upper boundary, note that if e is positive and 
independent of n, then our result for A > Aq gives that 

P(F„,^ is SAT) = exp{-e{e^n)). (1.5) 
This strengthens both the result of Fernandez de la Vega |[t'er98|| that P(F„^m is SAT) 



0{exp{—f{e)^/n)) and the recent improvement of Achlioptas and Molloy ||AM98|| that 
F{Fn,m is SAT) = 0(exp(-/(e)n)) for some f{e) > 0. 

We remark that in the random graph model, the existence of a complex component, i.e. 
a connected component with more than one cycle, is roughly analogous to the existence of 
a contradiction in a random 2-SAT formula. When there are m = ^n{l + Xn^^^^) edges, 
Britikov [[Bri89|] showed 



P 



1 - if -^(^) < A < -Ao, 

f^"'- contams no ) = (i + ^^(i)) J p(A) if -Ao < A < Ao, 

Vcomplex component / ^ " v / / ^ ^ / 



2i/4r(i/4) 11 Ao S A S uj[n) 



(1.6) 

where -P(A) is an explicit power series in A, and uj{n) is an unspecified slowly growing 
function of n. Later this formula was shown to be vahd for |A| < ra^/^^ ||jklP94| , pg. 289]. 
Observe that in comparison, the analogous bounds for random 2-SAT in Theorem p..l| are 
not so precise, but they hold for the full range of |A| <^ n^^^. 

The key to our analysis is the introduction of an order parameter for the 2-SAT phase 
transition. As usual in statistical physics, an order parameter is a function which vanishes 
on one side of the transition and becomes non-zero on the other side. Control of the growth 
of the order parameter was the key to Bollobas' analysis of finite-size scaling in the random 



graph |[Bol84|| , and to Borgs, Chayes, Kesten and Spencer's analysis of finite-size scaling 



in percolation ||BCKS9^ . Our order parameter for satisfiability is the average density 



of the spine of a Boolean formula, which we define as follows. Given a formula F in 
conjunctive normal form, we define the spine S{F) as the set of literals x such that there 
is a satisfiable subformula H of F for which H A x is not satisfiable, 

S{F) = {x\3H C F,H is SAT and H A x is UNSAT}. (1.7) 
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Our notion of the spine was motivated by the insightful concept of the backbone, 
B{F), introduced by Monasson and Zecchina |[MZ96|| — where the backbone density 



\B{F)\/n was originally called "the fraction of frozen variables." The backbone B{F) is 
the set of literals that are required to be FALSE in any assignment that minimizes the 
number of unsatisfied clauses in F. It is easy to see that B{F) C S{F), and in particular 
B{F) = S{F) if F is satisfiable. One of the principal differences between the spine and 
the backbone is that the spine is monotone in the sense that adding clauses to a formula 
only enlarges its spine. It is the monotonicity which enables us to achieve analytical 
control of the spine. In addition, we have found that the spine is easier to simulate than 
the backbone ||Wil98|| . We believe that the spine will become an important tool in the 



analysis of satisfiability problems. 

Consider now a satisfiable 2-SAT formula F. It is not hard to see that the addition of 
the 2-clause C = xMy makes F (or, more precisely, makes F AC) unsatisfiable if and only 
if both X and y lie in the spine. Building a random 2-SAT formula by adding clauses one 
by one at random to an initially empty (and hence satisfiable) formula, we can therefore 
control the probability that a formula is satisfiable if we have sufficient control of the 



spine in each step. This is the strategy we shall follow to prove Theorem ITTT 



In the course of proving Theorem |1 . 1| , we obtain detailed estimates on the expectation 
and variance of the size of the spine inside the scaling window m E [n — Q{n^^^),n + 
B(n^/^)], i.e. the finite-size scaling of the spine. Before stating these results, however, let us 
give the behavior of the size of S'(F„ m) on the scale n. To this end, let 'd : (0, oo) (0, 1) 
be the function satisfying 

1 - ^{e) = exp[~{l + e)d{e)], (1.8) 

i.e. 

^(e) = l-^__(l + £)'=-ie-(^+^)^ (1.9) 

k=l 

The kth term in ( |1.9| ) is the probability that a Poisson birth-and-death process with birth 
rate 1 + e will have size k, while '(^(e) is the probability that it is infinite. Note that 
^9(e) = 26 + 0(5^) for positive e sufficiently small. The size of the spine is given by: 



Theorem 1.3. For any fixed e G {—Sq^Sq), where Eq is the constant from Theorem 1.1 , 
we have 

'e(£-2) z/£<0 
^(|5(F„,„)|) = <[ 6(^2/3) z/£ = (1.10) 

2n^{£) + o{n) ife>0. 



The behavior above, coupled with the role of the spine in the proof of Theorem |0 
justifies our identification of the density of the spine as an order parameter for the 2- 
SAT transition. In the language of phase transitions. Theorem |T]^ implies that the 
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2-SAT transition is second-order (or continuous), with order parameter critical expo- 
nent (3=1. Here, as usual, we say that the order parameter has critical exponent jS if 
lim„^oo E{\S{Fn,rn)\)/n = 9(e'^) as e I 0, see discussion following Remark P3| . 

The next theorem states our results for the finite-size scaling of the spine S{Fn,m)- 

Theorem 1.4. Let Eq and Xq be the constants in Theorem |IT1|. Suppose |A„| < Equ^^^. 
Then 

riA>2/3(i + o(l)) z/A„<-Ao 
= <^ 6(^2/3) ^f |A„| < Ao (1.11) 

[4A„n2/3(i + o(l)) z/A„> Ao, 

where the o(l) terms represent errors which go to zero as |A„| — > oo and e = XnU^^^^ 0. 



Remark 1.5. In the course of proving Theorems p..l| , |1.3| and |1.4| , we shall prove bounds on 
the variance of \S{Fn^m) \ which allow us to generalize the above statements in expectation 
to statements in probability. 

Statistical mechanical models with second-order (i.e., continuous) transitions are often 
characterized by critical exponents which describe the behavior of fundamental quantities 
at or approaching the critical point. It turns out (see [pCKS98b|| and announcements 
Cha98|| and ||CPS99|| ) that it is possible to read off some of these exponents from the 



m 



finite-size scaling form of the order parameter and the scaling window. In particular, the 
scaling of the order parameter at the critical point allows us to evaluate the so-called field 
exponent 5 as 

E(|5(F„,J|) = e(nTT^) if |A„|<Ao. (1.12) 



Similarly (again using | |B(JKS98"5| , | |(Jha98[ and | |(JPS99| ]), the scaling of the window allows 



us to identify the exponent sum 2(3 + 7, according to 

W{n,5) = (l-e(n-i/(2/5+^)),l + 0(n-i/(2/3+7)))^ (I.I3) 

where (3 is the order parameter exponent described above, i.e. 

lim -^(|S(F„,J|) = e(£0 as ejO, (1.14) 

n—*oo n 



and 7 is the so-called susceptibility exponent. Comparing equations ( |1.14| ), ( |1.13| ), and 



( |1.12| ) to Theorem |1.3| , Corollary |1.2| , and Theorem |1.4| , respectively, we get the following. 

Corollary 1.6. The 2-SAT transition is a second-order (i.e. continuous) transition with 
critical exponents: 

(3 = 1, 7 = 1, and S = 2. 

Thus we have proved that the critical exponents of the random 2-SAT problem are iden- 
tical to those of the random graph. See ||BBCKWOO|| for a more detailed discussion of the 
critical exponents for 2-SAT. 
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The organization of this paper is as follows. In Section we discuss the well-known 
representation of 2-SAT formulae as directed graphs, a representation we use extensively 
in our proofs. In that section, we also derive new results on various representations of 
the spine in terms of directed graphs. While most of the results in Section Q concern 
given formulae, not distributions of formulae, a final result there gives a mapping of 
a distribution of certain sets in the graphical representation of random 2-SAT into the 
standard random graph model. In Section |^, we state our main technical estimates on the 



expectation and variance of the size of the spine, and formulate an analogue of Theorem ^ 
for the distribution F^^p- We then outline the strategy of our proof, giving first our 
heuristic for the expected size of the spine, and then showing how this will be used to 
obtain the size of the scaling window (Theorem |1.1|) . While the width of the scaling 
window can be determined from the spine expectation and variance estimates alone, the 
rate of approach from above in Theorem |1.1| requires that a sufficiently large spine forms 
with extremely high probability. In order to prove this, in Section ^, we define structures 
we call "hourglasses" which are basically precursors to the spine, and we state a theorem 
giving conditions under which a giant hourglass forms. The proof of the hourglass theorem 
is given in Section O. In Section |, we use the expectation and variance results on the spine. 



and the hourglass theorem, to establish the analogue of Theorem |1.1| for the distribution 
Fn,p- In Sections ^ and |^, we develop some machinery from random graph theory and 
derive moment bounds, which enable us to prove the expected size and variance results 
for the spine in Sections |^ and ^, respectively. Appendix A contains the conversion from 
our results on F„_p to F„_m, and Appendix B establishes a technical result on the cluster 
size distribution in the random graph problem. 



2. The Digraph Representation of 2-SAT 

In the digraph representation, each 2-SAT formula corresponds to a certain directed 
graph (or digraph) Dp. To motivate the mapping of F into Dp., note that F is satisfiable 
if and only if all clauses in F are satisfiable. Thus, if F contains a clause C = x \/ y, 
a satisfying truth assignment with x set to FALSE requires that y is set to TRUE, and 
a satisfying assignment with y set to FALSE requires that x is set to TRUE. So the 
clause X y y corresponds to the logical implications x = TRUE ^> y = TRUE and 
y = TRUE =^ X = TRUE. We shall encode this fact in the digraph Dp hj including 
the edges x ^ y and y x in Dp iS F contains a clause C = x\J y. 

To be precise, given a 2-SAT formula F, define the digraph Dp a.s the directed graph 
with vertex setQ 

[n] = {xi, . . . ,a;„,xi, . . . (2.1) 

and edge set 

Ep = {x ^ y \ (x V y) is a clause in F}. (2.2) 



^Note that we deviate from the standard notation, where [n] stands for the set {1, 2, . . . , n}. 
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Since [x V y) and {y V x) are considered to be the same clause, the digraph Dp contains 
the edge x ^ y if and only if it contains the edge y ^ x. As usual, an oriented path in 
Dp is a sequence of vertices vq, vi, . . . ,Vk E [n] and edges Vi — > f j+i for z = 0, 1, . . . , /c — 1. 
We say that this path is a path from x to y if Vq = x and Vk = y- We write x~^y, or 

Dp 

sometimes simply x y,ii Dp contains an oriented path from x to y. By convention, we 
shall say x-^x for all x. Finally, we say that Dp contains a contradictory cycle ii x -^x 
and X X for some x G [n] . 

The following lemma connecting the structure of the digraph Dp with the satisfiability 
of the formula F is implicit in all digraph analyses of 2-SAT, see e.g. [poe92|| . For 
completeness, we shall give an explicit proof here. 

Lemma 2.1. A 2-SAT formula F is satis fiable if and only if the digraph Dp has no 
contradictory cycle. 

Proof. Let us first assume that F is satisfiable, with satisfying assignment r^j G {0, 1}, i = 
1, . . . , n. Consider an edge x y in the corresponding digraph. Since F{t]i, . . . , rjn) = 1, 
the presence of the edge x ^ y gives the logical implication x = FALSE ^> y = TRUE. 
A contradictory cycle x x x therefore gives the logical implication x = TRUE =^ 
X = FALSE =^ X = TRUE, which is not compatible with any truth assignment for x. 

We prove the converse by induction on the number n of variables. For n = 1 there is 
nothing to prove. Turning to the induction step, suppose that the digraph Dp has no 
contradictory cycles. We claim that in this case F is satisfiable. To this end, we first 
recall the definition of strongly connected components for directed graphs. We say that 
two vertices x and y in a directed graph are strongly connected if x y x, i.e. if the 
directed graph Dp has a cycle containing x and y. The strongly connected component of 
a vertex x is the induced subgraph of Dp containing the set of vertices 

Cs{x) = I a; -w y -w x}. (2.3) 

Somewhat loosely, we call Cs{x) the strong component of x. Clearly, the strong compo- 
nent partitions the vertex set [n]. We define a partial order < on the set of all strong 
components by taking Cs{x) < Csiy) if x y, and so x' y' for all x' G Cs{x) and 
y' G Cs{y). Let Cs be a minimal element in this partial order, i.e. let Cs be a strong 
component such that Dp contains no edge x y with x ^ Cs and y G Cs- For a set of 
literals M, let 

M = {y\yeM}. (2.4) 

Since Dp has no contradictory cycle, fl = 0. Furthermore, since Cs is a minimal 
element in our partial order, Cs must be a maximal element. If we set all literals in Cs 
to FALSE, and so all literals in Cs to TRUE, then all clauses in F containing at least one 
literal from Cs U Cs are TRUE. This process removes all the variables corresponding to 
literals in Cs and Cs from [n], and all clauses involving these variables from F, leading 
to a new 2-SAT formula F'. Since the graph Dpi is a subgraph of Dp, it contains 
no contradictory cycles either. Using the inductive hypothesis, we obtain a satisfying 
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assignment for F, which completes the proof of the converse and hence of the theorem. 

□ 

Remark 2.2. If F is a mixture of one and two-clauses, i.e. if it is of the form F — H A 
xi A ■ ■ ■ A Xfc where if is a 2-SAT formula and literals, wc define Dp hy 

including the edges aJj — Xj, i = 1, . . . , fc, in addition to the edges in Dh- It is not hard 
to see that the above proof applies also to this situation, giving again that F is SAT if 
and only ii Dp contains no contradictory cycles. 

While the previous lemma says that contradictions in a formula correspond to cycles in 
the digraph, the next lemma says that the spine of a formula corresponds to "half-cycles" 
in the digraph. This graphical description of the spine is central to our analysis. 

Lemma 2.3. For every 2-SAT formula F, 

S{F) = {x I x-wx}, (2.5) 

Dp 

where Dp is the digraph corresponding to F. 

Proof. Suppose that x x, and let x = vq ^ vi ^ ■ ■ ■ ^ Wr-i Vr = x he a. shortest 
directed path from x to x in Dp. Then no literal appears twice in the path, although a 
hteral and its negation may well do so. Let i be the smallest positive integer such that ve 
is not strictly distinct from all oi Vo,Vi, . . . , vi^i, and let < A; < ^ be such that vi — v^. 
Then H — (vq V t)!) A (vi V ^2) A • ■ ■ A {vi-i V V() is a subformula of F that is satisfied by 
setting each of vq, vi, . . . , vi-i to FALSE. On the other hand, H A x = H A vq is UNSAT 
since in order to satisfy it, we would have to set Vq to TRUE, then vi to TRUE, and so 
on, ending with the requirement that be set TRUE. However, as Vk is TRUE, is 
already set FALSE. This completes the proof that x x implies that x E S{F). 

Conversely, suppose that H G F is SAT and H' = H Ax = H A{x\/x) is UNSAT. Then 
Dh' has a contradictory cycle C = u u u. Since Dh does not have a contradictory 
cycle, the cycle C of H' contains the oriented edge ^ — > a;, say u-^u-^x^x-^u. But 
then in Dh we have x u -^u -^x, so x -^x. Hence if x e S{F) then x-^x. □ 

Dp 

Our next lemma gives an alternative representation for the spine of a 2-SAT formula 
F. In order to state it, we introduce the out-graph Dp{x) of a vertex x in Dp as the set 
of vertices and edges that can be reached from x. Dp{x) therefore has the vertex set 

L+{x) = = {y I x^y}, (2.6) 

Dp 

and contains all edges y ^ z in Dp such that y € Lp{x). For future reference, we also 
introduce the in-set 

L-{x)=L-p{x) = {y\y'^x} (2.7) 

Dp 

and the corresponding in-graph Dp{x). Note that x e L'^ix) since, by our convention, 
X a; for all x. 
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As we shall see, the spine of a 2-SAT formula F can equivalently be described as the 
set of literals x such that Lp{x) is not strictly distinct, where for simplicity, we say that 
a set M C [n] is strictly distinct (s.d.) if the literals in M are pairwise strictly distinct. 

Lemma 2.4. For every 2-SAT formula F 

{x \ X x} = {x \ Lp{x) is not s.d.} 

Dp 

= {x I Lp{x) \ {x,x} is not s.d.}. (2.8) 

Proof. We start with the first equality in (|2.8[). If x -w x, then {x,x} C Lp(x), so Lp{x) 
is not strictly distinct. If Lp{x) is not strictly distinct, then {y,y} C Lp{x) for some 
literal y E [n], and hence x y and x y. But x y implies that y x, which 
together with x y implies x -^x. 

To prove the second equality, we first note that the set of literals x for which Lp{x) \ 
{x,x} is not strictly distinct is obviously a subset of the set of literals x such that Lp{x) 
is not strictly distinct. We are thus left with the proof that the statement that Lp{x) is 
not strictly distinct implies the (apparently stronger) statement that Lp{x) \{x,x} is not 
strictly distinct. So let us assume that Lp{x) is not strictly distinct. By the first equality 
in ( |2.(j| ), this implies x x. Since the digraph of a 2-SAT formula does not contain any 
direct edges from x to x, we conclude that there must be a literal y strictly distinct from 
X such that x -w y -w x. The latter statement implies that both x -w y and x y, so 
that Lp{x) \ {x,x} is not strictly distinct. □ 

Remark 2.5. As the above proof shows, the first equality in ( |2.8| ) is true for mixed formulas 
of 1- and 2-SAT clauses as well. The second is obviously false for mixed formulas of 1- 
and 2-SAT clauses, as the simple example of the formula F = x shows. 

The Trimmed Out-Graph. We end this section with a construction of a trimmed 
version of the out-graph Dp{x), which we denote by Dp{x) with vertex set denoted 
by L~^{x). The utility of this trimmed graph is that, by projecting it to an unori- 
ented graph, we shall be able to relate it to the more familiar random graph. Given 
any digraph on a subset of {xi, . . . ,Xn,xi, . . . ,Xn} we can project it to an unoriented 
graph on a subset of {xi,... ,x„} by dropping negations. In particular, each literal 
X G {xi, . . . , Xn, Xi, . . . , Xn} gets mapped to its corresponding variable (x) G {xi, . . . , x„}, 
and each clause x V ?/ gets mapped to the edge {(x), {y)}- We call this the unoriented 
projection of the digraph. Specifically, for F = Fn^p, we shall compare the distribution of 
Dp{x) for a fixed vertex x to that of the connected component of a given vertex in the 
random graph G„.2p-p2, where, as usual, Gn,p denotes the random graph on {xi, . . . , x„} 
that is obtained from the complete graph on {xi, . . . , x„} by keeping each edge with prob- 
ability p. We use the symbol C„^p(x) to denote the connected component of the vertex 
(x) in Gn,p. 

Construction of the trimmed out-graph. 
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We construct the trimmed out-graph Dp{x) by doing a local search in Dp starting from 
literal x, and at the same time we construct that portion of the random graph Gn,2p-p^ 
which determines the connected component of vertex {x). Let the "current graph" be 
that subgraph of Dp which consists of the vertices and edges that have been examined 
by the local search. The "frontier" consists of those vertices of the current graph from 
which further searching may be done. Initially the current graph consists of just the 
literal x, and x is in the frontier. Eventually the frontier will be empty, terminating the 
local search, at which point Dp{x) will be defined to be the current graph. During the 
search, certain edges v ^ w will be tested to see if they are in Dp, and the search records 
whether the results arc "yes" or "no" on the corresponding unoriented edge (v) ~ (w). 
These test results will later be used to construct the random graph Gn,2p-p2. Each step 
in the local search consists of the substeps listed below. 

1. An arbitrary literal v in the frontier is selected (one choice is the lexicographically 
smallest). 

2. For each literal w such that neither w nor w is in the current graph, check if v — > ty 
is in Dp, and record either "yes" or "no" on the edge (v) ~ (w) accordingly. 

3. For each literal w for which "yes" was recorded, declare w to be a "new literal" — 
unless "yes" is recorded for both w and w, in which case we declare only one (say 
the unnegated one) of them to be a "new literal." 

4. Adjoin each new literal w and the edge v ^ w to the current graph. 

5. Adjoin each new literal w to the frontier. Remove v from the frontier. 

6. Consider each ordered pair of vertices {w, /) such that either (1) w is new and / is 
in the frontier but not new, or (2) w and / are both new, and w is lexicographically 
smaller. Test if w ^ f or f ^ w in Dp, and record either "yes" or "no" on the edge 
(w) ~ (/) accordingly. If there is one "yes," adjoin the corresponding edge to the 
current graph, if there are two "yes"'s, adjoin only one of the edges, say the edge 

f. 

Lemma 2.6. The trimmed out-graph Dp{x) defined above has the following properties. 

i) Dp{x) is a subgraph of Dp{x). 

ii) Lp{x) is strictly distinct. 

iii) Lp{x) = Lp{x) if and only if Lp{x) is strictly distinct. 

iv) For F — Fn^p, the unoriented projection of the digraph Dp{x) has the same distribu- 
tion as Cn^2p-p^{^)- In particular, \L'^^^{x)\ and |C„2p-p2(x)| are equidistributed. 

Proof. By construction, properties (i) and (ii) are obvious. Property (iii) is not much 
more difficult. There are certain possible edges leading out of the vertex set Lp{x) that 
were never tested, or that were tested and present, but then excluded from the trimmed 
out-graph Dp{x) anyway. But each such edge either led to a literal already in Lp(x), or 
else led to a literal whose complement was in Lp{x). Thus if the literal set L'^{x) were 
to contain more literals than Lp{x), then Lp{x) would not be strictly distinct. On the 
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other hand, if Lp{x) and Lp{x) are identical, then Lp{x) is trivially strictly distinct by 
property (ii). 

Property (iv) is similarly easy. First, for each literal u e [n], we define [u] = {u,u}. By 
induction we shall prove that, at the beginning and end of each step of the search, the 
following properties hold: 

1. For every pair of literals u and v of the current graph, precisely two edges between 
[u] and [v] have been tested. 

2. For every literal v in the current graph but not in the frontier, and every literal w 
such that neither w nor w is in the current graph, precisely two edges between [v] 
and [w] have been tested, both results being "no." 

3. For every literal v in the frontier, and any literal w such that neither w nor w is in 
the current graph, none of the edges between [v] and [w] have been tested. 

4. If none of u,u,v,v are in the current graph, then no edges between [u] and [v] have 
been tested. 

5. For any pair of strictly distinct literals u,v & [n], either none or precisely one of the 
four edges between [u] and [v] appears in the current graph. The latter happens if 
and only if some test between [u] and [v] was positive. 

Indeed, assume that (1) - (5) hold at the beginning of a step. To prove that (1) holds 
at the end of the step, we first note that no edge between [u] and [v] was tested in the 
current step if neither u nor v is new. If v is old and u is new, then either v was the 
selected vertex in the frontier, in which case the edges v ^ u and v ^ u have been 
tested in the current step, or v was not in the frontier, in which case precisely two edges 
between [u] and [v] were tested in a previous step (with answer "no") by the inductive 
assumption (2) . If both v and u are new, then no edge between [u] and [v] was tested in a 
previous step by the inductive assumption (3), and precisely two edges (the edges rt — > v 
and V ^ u) between [u] and [v] are tested in the current step. 

To prove (2), we note that if v is in the current graph but not in the frontier, it was in 
the frontier in some previous step, and got removed from the frontier after all edges from 
V to vertices u, with neither u nor u in the current graph at the time, were tested. This 
includes in particular the vertex w in question, and since we assume that neither w nor w 
is in the current graph, it follows that both tests must have given the result "no" at the 
time. After that step, v is not in the frontier, so no edge containing v or v is ever tested 
again, implying statement (2). 

Statement (3) follows from the observation that an edge between a vertex v in the 
current graph and a vertex w such that neither w nor w is in the current graph is only 
tested if v is the selected vertex in the current step, in which case it is not in the frontier 
after this step anymore. 

Statement (4) is obvious, since an edge / — > w is only tested if either / is in the frontier 
(and hence in the current graph before the current step), or both / and w are new vertices, 
which means they are in the current graph after steps (1) - (6). 
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To prove (5), we consider three cases. In the first case, none of the vertices u, u, v and 
V is in the current graph, in which case no edge between [u] and [v] appears in the current 
graph by the inductive assumption (4). The second case is the one in which exactly one 
of the four vertices u, u, v and v is in the current graph. Without loss of generality, let us 
assume that this is the vertex v. Then none of the edges between [u] and [v] appears in 
the current graph by (2) and (3). The third case is that precisely two of the four vertices 
u, u, V and v are in the current graph, say u and v. Then precisely two of the four edges 
between [u] and [v] have been tested by the inductive assumption (1). Since the above 
search procedure always tests two of the four edges between [u] and [v] at a given time, 
and adds one (but not both) of them precisely when at least one of them tests positive, 
we get (4). 

We now use the properties (1) - (5) above to prove statement (iv) of the lemma. If 
we pick the unordered pairs of numbers between 1 and n in some arbitrary order, each 
time randomly saying "present" (with probability 2p — p^) or "absent" (with probability 
(1 — pY)-, then even if the order in which we pick the pairs depends on the previous 
random choices of present /absent, the result will be the random graph G'n,2p-p2. This is 
in effect what the trimmed local search does, except that it stops when the connected 
component containing (x) has been determined. Thus unoriented projection of Dp{x) is 
just the connected component containing {x) in G„,2p-p2- D 

3. Strategy of the Proof 

In this section, we shall first state our principal estimates and results for the distribution 
Fn^p (to be proved in later sections), and then give the heuristics for these results. 

3.1. Main Results for the Distribution F„,p. As explained in the last section, the 



spine of a formula F consists of all literals x for which x -^x (see Lemma which in 
turn is just the set of all literals x such that Lp{x) is strictly distinct (see Lemma p.4|) . 
If F is distributed according to the model the expectation and variance of the size 
of S'(F„ p) are therefore given by the equations 

^(l^(i^n,p)|)= $^P(X - X) (3.1) 

and 



x£\n\ 



Ei\S{F^,p)\^)-E{\S{F^,p)\y = V ¥(x X andy y) - ¥(x x)¥(y 



where x ^ x & shorthand for x x. 



Fn,p 



y 

(3.2 



Dp, 



The following two theorems allow us to prove suitable bounds on the expected size and 
variance of the spine of a random 2-SAT formula, and are at the heart of our proofs. 
Before we can proceed, we unfortunately need a short interlude on Landau symbols: 
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In this paper, we shall use Landau's notation / = 0{g) and / = o{g). As usual, 
/ = 0{g) stands for a bound |/| < c\g\, where c is a universal constant, unless otherwise 
specified. If we have a bound of the form |/| < h{g)\g\, where h{g) is a function which 
is bounded above, though not necessarily uniformly, for finite g, and which is uniformly 
bounded above as g goes to zero, we shall use the notation / = Oo(5'). In this notation, 
e^^ — 1 is Oo(x^), but it is not O(x^). 

Our use of the symbol o{g) is slightly stronger than usual. Typically, / = o{g) means 
that f /g goes to zero as the independent variables in question tend to their limiting values, 
but usually / = o{g) does not require that f/g is bounded in the whole domain of the 
independent variables. We require both uniform boundedness and that f/g tends to zero. 
Since it may be ambiguous which independent variables tend to cx) or in an expression 
of the form / = o{g), we frequently specify the variables in question. Thus / = 0A,e(5') 
means that //(? — ^ as A — » oo and e — >■ 0. For example, in this notation, the o(l) terms 
in Theorem would be written as Oe_A„(l)- 

Finally, as mentioned earlier, f = Q{g) means that there exist positive, finite constants 
Ci and C2 such that Ci < f / g < C2- Unless noted otherwise, these constants are universal. 

Theorem 3.1. There are constants Aq and eo? < Aq < oo and < eo < 1; such that 
the following statements hold for 

p=^{l+e) = ^{l + \nn-'/^) (3.3) 



and Aq < |A„| < EqU^^^ . 
i) If £ < 0, then 



a) If e > 0, then 



2n ' ' 2n 



-1/3 



/ \ ft ' 



P(i x) = tf(ir)(l+Oj„(l)), 

= 2A„n-i/3(l + OA„(l)+0(£)) (3.5) 

where d{e) is defined in (|1.8| ). 



Theorem 3.2. Letp, e and A„ be as in Theorem \3. 1\ . Then the following statements hold 
for all strictly distinct literals x and y. 
i) If e < 0, then 

Pfa; ^ x)p(?/ ^ y) < pfx x and y ^ y) =o(^\^). (3.6) 
a) If e > 0, then 

< f(x ^ X andy ^ y) -f(x x)F(y y) = o(^\^). (3.7) 

V Fn,p Fn^p / \ Fn,p J V fyi.p / V Aj^ / 
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Remark 3.3. By monotonicity, the bound ( p.6|) can be extended to all G [—n^^^, — Aq] 
Indeed, using that the events x -^x and y are monotone events, we have that 

P ( X X and y ~^ y] <^(x x and y y 



provided p < po. Setting po = {1 — eo)/2n, using equation (|3^ ) to bound the right 
hand side by 0(n~'^^^ /{n^^^y) , and observing that n''^^^ /{n^^^)'^ = 0(n"^/^/A^) provided 
A„ E [—n^^^, — ^o^^^^^], we obtain that 

f(x X andy y) =o(^\^) for all A„ G -Aq]. (3.8) 



Given the above two theorems, we shall prove the following analogue of Theorem ITT 



for the ensemble F^^p. 



Theorem 3.4. There are constants Aq and Eq, Q < \q < oo and < £o < 1; such that 
the following statements hold for p = 2^(1 + ^nn^^^^) o.nd Aq < |A„| < Equ^^^ . 
i) If Xn < 0, then 



ii) If Xn > 0, then 



F{Fn,p is SAT) = exp ( - e(|A„|-=^)). (3.9) 



F{Fn,p is SAT) = exp ( - e (A^) ) . (3.10) 



For fixed A„, both ( |3.4D and ( |3.5D are of the form P(x -w x) = Q(n~^/^). Together with 
equation (P?T|), Theorem P?T| therefore implies that the expected size of the spine scales 
like n^/^, provided A„ stays bounded as n ^ oo. The heuristics for this scaling with n 
will be given in the next subsection, and the actual proof of the scaling will be given 
in Sections Theorem |3]^ allows us to control the deviations of the random variable 
\S{Fn,p)\ from its expectation; its proof will be given in Section ||. Together, these two 
theorems allow us to prove Theorem ^.4| , which is just the analogue of Theorem |TTT] in the 
model Fn^p. In the final subsection, we shall describe the strategy of this proof. While 
the actual proof is easier in the model Fn^p, the heuristic argument is easier in the model 
Fn,m- Our goal in the last subsection is therefore to describe how the scaling n^^^ for the 
size of the spine in the model Fn^m leads to bounds of the form (|3.9|) and ( |3.10|) . The 



actual proof of Theorem 3.4 is given in Section 



3.2. Heuristics for the Scaling of the Spine. The proof of Theorem ^ (and 
hence also the proof of the expected size of the spine. Theorem |L^) uses the digraph 
representation of the last section. Indeed, by Lemmas ^.3|, P^, |2.6| (iii) and p^.6| (iv), and 



the fact the probability of the event x x does not depend on the choice of the literal 
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X G [n], we have 

E{\S{F^,p)\) = 2nF{x^x) 



2nP f L+ (x) ^ L+ (x 

n 

2nJ2 [P (iLtM = k)-F {\L+„Jx)\ = k, L+^Jx) is s.d) 



k=l 
n 



2nY, [P(|a,2p-p^(x)| = /^)-P(|L+ =fc, L+^_^(x) is s.d 
fc=i 



(3.11) 



It turns out that for 2p — near to the random graph threshold 1/n, and k < 0(n^/^), 
the size of the largest component in the random graph, the probability that L^^^{x) is 
strictly distinct and has size k is well approximated by F {\Cn,2p-p'^{x)\ = k), so that the 
summand in equation (|3.11|) is approximately zero. On the other hand, for 2p — near 
1/n and k > 9(n^/^), only the sum over P (|C„_2p-p2(x)| = k) contributes to ( p.ll| ). Thus 
we can approximate 

E{\S{Fn,p)\) ^ 2nF {\Cn,2p-p<x)\ > n^/') . (3.12) 

As the reader might imagine, the above arguments require a good deal of justifica- 
tion; see Section |]-|^ for precise bounds. But for 2p — p^ near 1/n, the probability 
that |C„ 2p-p2(x)| > m?/'^ scales like the probability that x lies in the largest compo- 
nent in the random graph, which in turn scales like n~^/^ (see e.g. [ |Bol85|| ). This implies 
that the expected size of the spine S'(F„p) scales like m?^'^ provided p is of the form 
p = 2^(1 ± 0(n~^/^)). Observing that the models Fn^p and Fn,m are equivalent as long as 
m is near its expected value (/^)p (see Appendix A), we obtain the scaling of Theorem |L4. 



3.3. Heuristics for the Scaling of the Window. As explained earlier, our goal is 
to describe how the behavior E{\S{Fn^rn)\) = 0(n^'^^) leads to the bounds ( p.9|) and ( p.lO|) . 



To this end, consider a process which builds random formulas as follows: Given a 2-SAT 
formula F^, let Fm+i = F^ A C, where C = x V y is chosen uniformly at random from 
the set of all 2-clauses over {xi, . . . that have not yet been used in Fm- Obviously, 
the distribution of F^ is the same as that of -F„,m- Furthermore, F^+i is satisfiable if and 
only if Fm is satisfiable and either x or y does not lie in the spine of Fm- Conditioned on 
the events that F^ is SAT and that S{Fm) has size s, the probability that Fm+i is SAT 
is therefore equal to 

P(SAT(F^+1)| SAT(F^) and |^(F„)| = = 1 - Q (^Q " ^) 

= 1- ^/^^'/^ . (3.13) 
4n(?T. - 1) - 2m 
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By the analogue of Theorem for the model Fn^m and the monotonicity of the expected 
size of the spine, we have that near the transition point, E{\S{F)\) = B(n^/^). Neglecting 
the difference between statements in expectation and statements in probability, equation 
( p.l3| ) therefore implies that the probability that Fm+i is UNSAT, conditioned on F„ 



being satisfiable, is G(n^/'^/n^) = Q(n~'^^^). After 0(71^^^) steps, a satisfiable formula 
therefore becomes UNSAT, giving the a finite-size scaling window of width G(n^''^) in m, 
and hence of width B(n~^/^) in m/n. This is the result in Corollary |1.2| . 

In order to explain heuristically the error bounds exp(— 6(|A„|~'^)) and exp(— 9(A^)) 
in Theorem |L1|, we proceed as follows. As a consequence of (|3.13| ), we have that 

P(SAT(F„,,)|SAT(F„))=1-^ _ _ (3.14) 



and hence 



nSAT(F.„)) = n ( 1 - ^ 4n(n-l)-2. 



fc=0 



(3.15) 



Neglecting the difference between conditional and unconditional expectations, and ap- 
proximating \S{F,)\' - by we get 

m-l . . 

nSATiF^))^]j(l-^-^E{\SiF,)n] 

k=0 ^ 

m-l X 

• (3-16) 

i.— n j' 



For A„ < and k = {1 + Xn ^/^)n, A G (— n^/^, A„), we then approximate E{\S{Fk)\'^) 
by E{\^S{Fn,p)\'^) , p = (1 + \n^^/^) /2n. Using Theorems |3]l| and ^]2| to estimate this 
probability, we have 



P(SAT(F^)) ^ ey.^(^ - 1^ ^dkE{\S{F,)\Y^ 



exp 



All? 

= exp(e(|A„|-3)), (3.17) 

giving the bound ( |1.4|) below the threshold. In a similar way, we can integrate the bound 
( p.5[ ) to obtain a heuristic derivation of ( |1.4| ) above the threshold. 

The actual proof of Theorem |1.1| in the form ( |3.9| )-( prT0| ) will be given in the next 



section, and relies heavily on Theorems |3.1| and |3.2|, which in turn are proven in Sections 



^|-^ In addition, we shall need two more technical lemmas, to be proven in Section y. In 
Appendix A, we discuss the relation between the models Fn,m and Fn,p- 
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4. Probability of Satisfiability 



In this section we prove Theorem |3.4| , which together with Appendix A estabhshes 
Theorem |1 . 1| . The lower bounds depend on the second moment estimates of Theorem 



which is proved in Section ^ The upper bounds depends on a theorem showing that with 
high probabihty there are many structures, to be called hourglasses, that are "seeds" for 
the growth of the spine. The hourglass theorem is proven in Section ^, after we develop 
suitable machinery in the intervening sections. 

To derive the bounds, we shall find it convenient to view the random formula F„ ^ as a 
process, and to couple the processes for all possible values of n and p. To do this, for each 
unordered pair of natural numbers we pick four random variables uniformly distributed 
on the interval from to 1, so that the set of all these random variables indexed by 
4 copies of (2) (sometimes denoted N*^^-*) is a set of independent random variables. As 
usual (^) denotes the set of unordered pairs of elements of the set X. A pair of natural 
numbers specify two different variables of the formula, and the four random variables 
associated with the pair correspond to the four different clauses that can be made using 
these variables, so that each possible clause C has its own independent random variable 
Uc distributed uniformly in [0, 1]. A clause C appears in the formula on n variables at 
probability p precisely when the indices of its two variables are not larger than n, and 
Uc < P- We shall call Uc the birthday of the clause C . The process F is the collection of 
all these random variables, and it defines a family of formulas by 

Fn,v= A ^' (4.1) 

C:n{C)<n, Uc<P 

where n{C) denotes the maximum index of the two variables in clause C. It is easy to 
see that for each value of n and p, the distribution of the resulting formula Fn^p is exactly 
the distribution introduced before, which justifies our using the same notation as before. 
By construction, satisfiability of Fn^p is monotone decreasing in n and p. 



Lower bounds. To derive the lower bounds of Theorem |1 . 1| , it is sufficient to consider 
the coupling above for a fixed value of n, so we suppress the variable n. Given F (i.e., Fp 
for each p), we define the reduced formula process $ = ($p)pe[o,i] as follows: $0 has no 
clauses. Start with p = 0, and increase it until p = 1. Clauses are added to Fp one at a 
time; each time we add a clause to Fp, we also add that clause to $p provided that doing 
so does not make $p unsatisfiable. 

Given $ we can define a new formula process H as follows: For each clause C appearing 
in $, set its birthday in H to coincide with its birthday in $. For each clause C not 
appearing in $1, there is some smallest value Pmin of p for which $p A C is not satisfiable. 
Pick the birthday of C in Hp uniformly at random in the interval [pmm, !)• Since the H 
process is drawn uniformly at random from the set of F processes with reduced formula 
process $, the H and F processes are identically distributed. Also note that the first time 
that Hp differs from $p is also the first time that Hp becomes unsatisfiable. 
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Let U{F) denote the number of different clauses C such that F AC is unsatisfiable. If 
F is itself satisfiable, then 




where as usual S{F) denotes the spine of F. 

Conditional on the reduced formula process being $ and Hp being satisfiable (i.e. that 
Hp = the probability that Hp^s is satisfiable is 

provided 6 is small enough. Multiplying these probabilities for p = 0, 5, 25, ... , and passing 
to the limit 6 —>■ 0, we find that conditional upon $, the probability that Hp is satisfiable 
is given by 

P[SAT(i/p)|$] = exp 

We have 



Jo . 



P[SAT(Fp)] = P[SAT(i7p)] 

= E^[F[SAT{Hpm] 



= exp 
> exp E^ 
= exp 

= exp 



1 - s 



-ds 







1 - s 



ds 
ds 



> 



exp 



1 - s 

4:(2)'^[x X and y -w y in $s 
^ W[x -w X and y y in F, 



ds 
'^ds 



where x and y are fixed strictly distinct literals. 

Next we proceed to estimate the integral. Since we are principally interested in the 
case p = 0{l/n), let us assume p = o„(l) so that the effect of the denominator of the 
integrand is negligible. Recalling that p and A„ are related by (|3.3|), and setting 



sit) 



1 + tn-^/^ 
2n '■ 
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we get 

P[SAT(Fp)] > exp 



:i + o„(i)y/3 



,1/3 



\x -w 



X and y -w y in Fs(^t)]dt . (4.2) 



) in the model -F„,m- Indeed, 



Remark 4.1. It is not hard to derive the analogue of 
starting from ( p.l5|) , rewriting 

E(\SiFk)\^ - SAT(F,)) = 4Qp(x ^ x and y ^ y in F,| SAT(Ffc)), 

where x and y are strictly distinct, and observing that by the FKG inequality, 
P(x a; and y y in Fk \ SAT(Ffc)) < P(x x and ?/ -w y in Fk), 



one gets 



m— 1 



P( SAT(F„,„)) > n P " + Oim/n^))F{x ^ x and y -w y in F,,^) 



k=0 



Proof of the lower bound of Theorem \3.4 in the subcritical regime. For t e [-n^/^,-Xo], 
the probability in the integrand in equation (4^) is 0{n~'^^^ /t^) by Theorem |3.2| and 
Remark |3.3| . Integrating, we find that 



provided A„ G 



P[SAT(Fp)] > exp 



An. 



1-e 



I A, 



□ 



Proof of the lower bound of Theorem \3.4 in the supercritical regime. By Theorems pJ] 
and ^]2|, the probability in the integrand in equation ([4.2|) is 4n^^/^t^(l + 0(£:) + oa„(1)), 
provided t E [Aq, eo'^^^^]- In the middle region t E [— Aq, Aq] we upper bound the probabil- 
ity in the integrand by B(n~^/'^), and we bound it in the left region t E [—n^^^, — Aq] by 
0{n~'^l^ jf^) as above. Integrating, we find that 

4 + o.,A„(l) 



P[SAT(Fp)] > exp 



-A. 



provided A„ E [Aq, ^o^^^'^]- 



□ 



Upper bounds. To bound from above the probability of satisfiability, we use Theo- 
rem below, which states that with high probability there are certain types of structures 
contained within the directed graph associated with a formula. 

Definition 4.2. An hourglass is a triple (f , J„, 0„) where f is a literal, and 1^ and are 
two disjoint sets of literals not containing f , such that for each x E Iv, there is a path 
X -w t) in U {v}, and for each y E Oy, there is a path f ^ ?/ in U {v}. Furthermore, 
we require that {v} U ly U is strictly distinct. We call v the central literal, /„ the 
in-portion, and Oy the out-portion of the hourglass. 
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Theorem 4.3. There are constants Aq and Eq, < Xq < oo and < eo < 1; such that for 
p = 2^(1 + Ann^-*^/^) and Aq < |An| < SoTi^^^, the following statements hold with probability 
at least 1 — exp(— 9(|A„|^). 

i) If Xn < 0, then there are at least G(|A„p) disjoint, mutually strictly distinct hourglasses 
with in-portion and out-portion each of size at least n'^^^/X'^. 

a) If Xn > 0, then there is at least one hourglass with in-portion and out-portion each of 
size 9(A„n^/^). 

The proof of this theorem will be given in Section ^. There we shall use the coupling of 
the trimmed out-graph Gp^ ^ (x) to the random graph process G„,2p-p2 (see Lemma ^]6| and 
its proof) to explicitly construct 6(|A„p) many hourglasses below threshold. To prove the 
theorem above threshold, we shall show that, when A is increased from its value below 
the threshold to its value above the threshold, a constant fraction of these subcritical 
hourglasses will merge into one giant hourglass of size 6(|A„,|^)6(n^/'^/A^). See Section |] 
for details. 

Here, we shall use the hourglasses to derive the upper bounds on satisfiability both to 
the left and to the right of the window. 

Proof of upper bound of Theorem \3.4\ in the subcritical regime. To get the bound on the 
left, we increase p from (1 — tn~^^/2n to (1 — (t/2)n^^^^) /2n, with Xq < t < CoU^^^. 
For any pair of vertices, whether or not there was a directed edge between them before, 
afterwards the probability of finding such an edge is at least {t/4:)n~^^'^ . For each hourglass, 
for each pair of literals u and v in the out-portion of the hourglass, if the clause uV v 
appears, then we claim that the central vertex and the entire in-portion of the hourglass 
is afterwards part of the spine of the formula. Indeed, let x be such a vertex. Then x u 
and X V since u and v are in the out-portion of the hourglass. But the appearance of 
the clause uVv implies that m — >■ w, so that we have x u ^ v. Together with x -w 
which is equivalent tov-^x, we conclude that x -^x. The probability of the event that 
the clause u\/v appears is at least 9((n^/^/t^)^)f:n~'^/'^) = Q{l/t^). If furthermore a clause 
appears that contains two literals in the in-portion, then the formula is not satisfiable. 
These events are independent, so the probability that they both occur is at least 6(l/t^). 
But since with high probability there are Q{t^) hourglasses, with probability at least 
6(l/t^) the formula becomes unsatisfiable. Setting t = 2|A„| gives the desired upper 

bound on the left. ^ 

□ 

Proof of upper bound of Theorem \3.4 in the supercritical regime. To get the bound on the 
right, we start with p = (l-\-tn~^'^/2n (with Aq < t < eon^^^), where the probability that 
there is no giant hourglass is at most exp(— G(t^)), and then increase it to (l+2tn~^^^)/2n. 
Any clauses of the form {u V v) where u and v are in the out-portion of the the giant 
hourglass beforehand will afterwards appear with probability at least tn~^^^. This will 
cause the in-portion of the giant hourglass to become part of the spine of the formula, 

except with probability that can be bounded by (l — tn"'*/^)^^*" "* = exp(— 9(t^)). 
Furthermore, any clauses of the form [u V v) where u and v are in the in-portion of 
the giant hourglass beforehand will afterwards appear with probability at least tn~^/^. 
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Therefore the formula will become unsatisfiable, except with probability that is again 
exponentially small in t^. Setting t = A„/2 completes the proof. —>■ 



Remark 4.4. Instead of using Theorem [4.3| , one can alternatively use Theorems |3.1| and ^[2] 
to prove that below the window, the spine has size at least E{\S{Fn^p)\)/2 with probability 
exp(— 0(1)). Increasing p as in the above proof, one obtains an alternative proof of the 
fact that, with probability at least G(l/|An|^), the formula becomes unsatisfiable below 
the threshold. While a similar argument can be used to show that above the window. 



the probability of satisfiability goes to zero, we cannot use Theorems |3]l| and |3]^ alone 
to prove that it goes to zero exponentially fast in A^. For this, we need the hourglass 
theorem. 
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5. Machinery from Random Graph Theory 

In this section we establish several bounds needed in the proofs of Theorems |3.1| and 
As in earlier sections, we use the notation Gn,p for the (unoriented) random graph on 
{1, 2, . . . , n} with edge probability p. We also consider Dn,p, the random directed graph 
on {1,2,..., n} in which each oriented edge is chosen independently with probability p. 
We use the symbol L^p{x) to denote both the set of vertices ?/G{l,2,...,n} that can be 
reached from a vertex x in the random digraph -D„^p, and the set of vertices y G [n] that 
can be reached from a vertex x in the digraph corresponding to a random 2-SAT formula 
Fn,p. If the difference is not clear from the context, we shall use the notations L^^^(x) 
and (x) to distinguish the two cases. We begin this section with a basic lemma which 
is implicit in the work of Karp [[Kar90 . 



Lemma 5.1. The probability that in the random digraph Dn.p every vertex can be reached 
from a given vertex is precisely the probability that the random graph Gn,p is connected. 

Proof. We may assume that the vertex in question is vertex 1. First, we shall inductively 
define a subtree T = T(D„ p) of -D„ p, rooted at 1, with each edge oriented away from 
1. To this end, set Xq = {!}, Iq = 0, and let Tq be the subtree of Dn,p with the single 
vertex 1. Suppose that we have defined a pair of subsets of {1,2, . . . ,n} with 

Yi C Xi, and a subtree Tj with V{Ti) = Xi. (We think of Yi as the set of vertices we have 
"exposed", i.e. tested for outgoing edges, and Xi as the set of vertices we have selected so 
far.) If Xi = Yi then Tj is our tree T. Otherwise, let Xi be the smallest element of Xi \ Yi. 
Let r+(a;j) denote the set of vertices in {1,2,..., n} that can be reached by single edges of 
Dn^p that are oriented outward from Xj. Now set Xj+i = XiUT~^{xi), Fj+i = YiU{xi}, and 
take Tj+i to be obtained from Tj by adding to it the vertices in Xj+i \ Xi, together with 
all the edges from Xi to Xj+i \ Xi. The vertex set of the subtree T of Dn^p constructed in 
this way is clearly L„ ^(1); in particular, L„p(l) = {1, 2, . . . , n} iff V{T) = {1,2,..., n}. 
Since the edges of T are oriented away from 1, we may view T as an unoriented tree. 

Now, let us construct a subtree T' = T'{Gn,p) of the random graph Gn,p rooted at 1 by 
precisely the same algorithm. The lemma will follow if we show that 

P( T(D„,p) = To ) = P( T'(G„,p) = To ) (5.1) 

for every tree Tq with vertex set {1,2,..., n}. 

Given Tq, we can define the Xj's as above. We have T{Dn^p) = Tq if and only if the 
random digraph Dn^p is such that 

1) it contains all the edges of Tq (oriented away from vertex 1), 

2) it contains no edge oriented from Xi to {1,2, . . . ,n} \ Xj+i. 
Similarly, T'{Gn,p) = Tq if and only if the random graph Gn,p is such that 

1) it contains all the edges of Tq, 

2) it contains no edge from Xi to {1, 2, . . . ,n}\ Xj+i. 

Notice that the probability that -D„,p contains a given set K of oriented edges, and no 
edge of a second set K' of oriented edges, is pl^l(l — p)'^'' provided that K f] K' = ^. 
Moreover, this is equal to the probability that Gn,p contains a set K of unoriented edges. 
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and no edge of a second set K' of unoriented edges, provided that Kf\K' = ^, \K\ = \K\ 
and \K'\ = \K'\. Thus relation (B.ll) holds, and we are done. □ 



Returning to the 2-SAT problem Fn^p, let x be a fixed literal. The probability that, 
in a random 2-SAT formula -F„,p, the set L^p{x) consists of k strictly distinct literals is 
trivially independent of x; we shall denote it by Pn,p{k): 

Pn,p{k) = n{Kp{x)\ = k}n is s.d.}). (5.2) 

Lemma 5.2. For all n, k and p, with 1 < k < n and < p < 1, we have 

Pn,p{k) = 2''-' (^^ - i) ~ p)2fcn-3fcV2-fc/2p^ ^^^^ COUUectcd ). (5.3) 

Proof. Let X be a set of k strictly distinct literals with x E X. For y, z G X, the 
dual of the implication y z involves no literal in X. Therefore the probability that 
-^F„ p(^) = ^ is -Pa Pb, where Pa is the probability that every vertex of the random digraph 
Dx,p can be reached from x and Pb is the probability that the random 2-SAT formula 
Fn^p contains no implication from the set /(X, X^) = {y ^ z : y E X, z ^ X}. 

By Lemma we have Pa = F{Gk,p is connected), so we turn to the task of calculating 
the probability Pb- 

Note that there are k{2n — k) implications in the set /(X, X'^). However, a 2-SAT 
formula F^^p contains none of the k implications y —>■ y, y E X . Also, iiy E X and z E X, 
then y ^ z and z ^ y are dual implications, i.e. F„ p contains y ^ z if and only if it 
contains z ^ y. In fact, both implications y ^ z and z ^ y belong to I{X,X'^) if and 
only a y E X, z E X and y y^z. Hence I{X,X^) contains (fc^ — k)/2 dual pairs, so that 
the probability that contains no implication from /(X, X^) is 

^ (^l _ p-jfc(2n-fc)-fc-(fc2-fc)/2 ^ _ ^-j2fcn-3fc2/2-fe/2 

Therefore, 

P(L^^^(x) = X) = r{Gk,p is connected )(1 -p)2"'^'/2-^/2. (5.4) 
Since there are 2^^^(2zl) choices for the set X, the lemma is proved. □ 

In order to transform Lemma |5.2| into a form suitable for applications, note that the 
probability that Gk,p is connected is trivially expressed in terms of f{k,m), the number 
of connected labelled graphs with k vertices and m edges, and that f{k,m) has a good 
and fairly simple approximation when m — is not too large compared to k. To state this 
approximation, let us define an array of numbers Ck^e by f{k, k — 1 + i) = Ck/k^~'^^^^/'^. 
The somewhat peculiar choice of the parameter i is justified by the fact that c^/ 7^ 
if and only if < £ < — k + 1. Also, if i is not too large then f{k, k — 1 + i) has 
order about /^'=-2+3^/2^ More precisely, since f{k,k — l) is just the number of trees with k 
labelled vertices, by Cayley's theorem we have c^^o = 1- Also, 

c,,i = (l + 0(fc-i/2))(7r/8)i/2, (5.5) 
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Ck/ < 1 for all /c > 1 and i >0, and for all 2 < £ < (2) ~ ^ + 1 ^^'^ some c < 00, we have 

Ck,i < {cllf\ (5.6) 



see 



Bol85|| . When i is fairly small compared to A;, there are rather detailed estimates for 



c^p To be precise, Wright ( ||Wri77|| , [[Wri80|| ) showed that for 2 < £ = o(/c^/^) we have 



Cfc,, = 7(3vr)i/2^ 



12(£- 1) 



(5.7) 



where 7 = 0.159155 ... is the the limit of a certain bounded increasing sequence. Later 
Meertens proved that 7 = l/(27r) (see [|B(J1V190|| ). 

The probability that Gk,p is connected and has — 1 + £ edges is just 



so Lemma |5.2| has the following immediate consequence. 

Corollary 5.3. For all n, k and p, with 1 < k < n and < p < 1, we have 



Pn,p{k) 



1 fn 
n\k 



{2pkf'\l-p) 



2kn-k^-2k+l 



Sp{k), 



{5.t 



where 



(^)-^+i 



If k^^'^p/ {1 — p) is bounded, then 



(5.9) 



Sp{k) = 1 + 



IT k^/'^p 
\ — p 



l + 0{k 



-1/2^ 



^3/2 



P 



1 — p 



(5.10) 



where Oo(-) is the Landau symbol introduced at the beginning of Section |^. 



To estimate Pn^p{k), we relate it to known bounds on related events in random graphs. 
To this end, we recall the definition of the trimmed out-graph Dp{x) in Section ^ and 
its relation to the random graph G'„,2p-p2 on n vertices with edge probability 2p — p"^, see 
Lemma Recall that the vertex set of Dp^^{x) is denoted by L^p{x). We define 



Q„,p(fc)=P(|L+^(x)| = fc). 



(5.11) 



By Lemma |2.6| part (iv) 



QnA^^) =H\Cn,2p-p^{x)\ = k} 



(5.12) 
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where Cn,2p-p'^{x) is the connected component in Gn,2p-p'^ containing a fixed vertex x. For 
all n, k and p, with 1 <k <n and < p < 1, we have 

Qn,p{k) = ^ (1 - 2p + p2)fc(n~fc)p( ig connected ) 



1 fn 
n\k 



{{2p - p^)k)''-\l - j9)2'="-'^'+'='-^'+2^2p-p2(A;). (5.13) 



In Section we shall also need bounds on the probability Rn,p{k) that C„.2p-p2(x) is a 
tree of size k, 

Rn,p{k) = P {\Cn,2p-p2{x)\ = k and C„_2p-p2 (a;) is a tree) . (5.14) 

Recalling the derivation of Corollary |5.3|, we immediately see that Qn,p{k) and Rn,p{k) are 
related by 

Qn,p{k) = Rn,p{k)S2p^p2{k). (5.15) 

The following lemma will be used to turn well-known bounds on Qn,p{k) into bounds 
on Pn,p{k). 

Lemma 5.4. For all < p < 1, 

Pn,p{k) < Qn,p{k). (5.16) 

If P < 1/2 and k^^'^p is bounded, then 

< QnAk) - Pn,p{k) = 1 + 0{k-'") + Oo (^) Pn,p{k) . (5.17) 

If P ^^/"^ O'lT'd k^^'^p > 1, then 

Pn,p{k) = o(£o2-'''Q„,p(A;)), (5.18) 

where 

4 = Uk) = min |^^|^^,n^/^| . (5.19) 



Proof. By Lemma 2.6, Lt (x) is strictly distinct if and only if Lt (x) = Lt (x). As a 
consequence, 

P^^k) = nL^J^) = L+Jx) and \L+^Jx)\ = k) 
= nL^A^) = L+^Jx) and \L+^Jx)\ = k) 
<n\LtS^)\ = k) = Qn,p{k), (5.20) 



28 B. BOLLOBAS, C. BORGS, J. T. CHAYES, J. H. KIM, D. B. WILSON 

which proves ( |5.16| ). Rewriting ( p.l3[ ) in the form 



1 /n 
n\k 



\2kn~k^~3k+2 



s. 



2p—p- 



= PnAk){l - p/2)'-\l - pr''+'S2p-Ak)/Sp{k) (5.21) 
>PnAk)S2p-p^{k)/Sp{k) (5.22) 

and observing that Sp{k) is a monotone increasing function of p/{l — p), and hence of p 
(see (|5.9| )), we have S2p-p2{k) > Sp{k), obtaining an alternative proof of the bound ( |5.16|) . 

If p < 1/2 and k^^'^p is bounded, then both p/^p/(l —p) and k^^"^ {2p — p"^) / {1 — 2p + p'^) 
are bounded by a constant times k^^'^p. By ( |5.10|) , we therefore have 

S2p-p^{k) , , pnlt^/'^p 



Sp{k) 



1 + 



p 



and hence by ( |5.21| ) 

QnM=Pn,p{k){l + 0{kp)) ( 1 + 

As a consequence, 

Qn,p{k) ^ 



l + 0{k 
IT k^/'^p 



1 — p 



l + 0{k- 



-1/2' 



(5.23) 



^3/2 



p 



p 



1 + Oik 



Pn,p{k) 

which imphes the bound (|5.17| ). 

In order to prove ( p.l8| ), we decompose Sp{k) as 



,^3/2p. 



,j^3/2p., 



,p/2py 



o<e<eo 

= s'^ik) + s';{k) , 

where Sp{k) is the first and Sp{k) is the second sum above. If £ < n^/^ then i = o{k^/^), 
so by ( |5T7| ) we have 

gl/2p/2p y,„i)_ 



S'p{k) = Oi J2 



12l/2(£_ 1)1/2(1 _p) 



P 



2<e<eQ 

By taking the ratio of successive terms in the series 



el/2^3/2p 



-{£-1) 



gi/2p/2p e , e^/^k^/^p \ 1 



12 



1/2(1 _p)j £1/2 £ 



-(i-\)l2 



121/2(£_1)1/2(1 Vl2l/2£l/2(l_p), 

- V 121/2(1 -p); £1/2 

we see that the summand is an increasing function of I for 2 < £ < £o + 1- Since the 
(£o + l)st term is in the sum for Sp{k), we then have 
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To bound S''{k), we note that 



2p 2p — 

1 — p 1 — (2p — p^ 



so that 



< 2" 



/fc3/22pN 



E/ ft. ' ^p\ 



2-io 
2-^0 5// 



2p—p- 



and therefore 



Combined with (|5.22| ), this gives the desired bound ( 5.18 ). 



□ 



Our later estimates rely heavily on bounds on the expectation of the size of the com- 
ponent of a given vertex in a random graph, Lemma below. Although sharper forms 



of these bounds were already proved by Bollobas | |Bol84| , Luczak ||Luc90|| , and Janson et 
al. [|JKLP94|] , these previous estimates were proved only for a restricted range of A (e.g. 
A < n^/^^), whereas we require the full range (i.e. A„ < ^o'^^'^^)- These estimates turn out 
to be rather involved; the proof is given in Appendix 0. 

We remark that the bound we shall obtain in ( p.24| ) below is closely related to the 
order of the giant component: if A„ — 00 and A„ = o(n^/^) then, for p = (1 + A„?2^^/^)/n, 
with probability tending to 1, the random graph Gn,p has a unique giant component with 
(2 + o(l))A„n^/^ vertices. 

Lemma 5.5. There are constants c, Sq and Aq, c > 0, < < 1? 0'''^d < Aq < 00 
with the following property. Let Aq < A„ < Equ^^^ and, as before, set e = \nn~^^^ and 
p= [1 + \^n-^/^)/2n. Then 



QnAk) = m{i + o{i/xl)) 



(5.24) 



fc>A„n2/3 



and 



fc=[n2/3/A„J 



-cA„^-l/3-j 



(5.25) 
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6. Moment Estimates 

In this section we shall bound the moments of the number of literals x for which the set 
L^p(x) consists of strictly distinct literals. Recall that Pn^p{k) is the probability that, for 

a variable x, the set L^p{x) consists of k strictly distinct literals. We use Corollary ^]3] 
to get a good estimate of Pn,p{k). Note that 



exp[-l/(m + 5,)]^^ ,^^(^ I 



k-l 
1=0 



n 



where < 5^ < 1 for each k. Recalling that 2np =! + € = ! + XnU with |£:| < Eq, 
£0 < 1, we write 



and 



and obtain 



1(1 M ^ of 



^ 2n~^ ~ \n) Qii? ~^ ~ Kin?) ~ 



Expressing 

(1 - p)2fc-fc'-2fc+i _ gxp I - (p + Q{p^)){2kn - A;2 - 2A; + 1) I 



exp <^ - A; - efc H \ h ^ - Q{p^kn) \. 

I 2n 2n n J 



we therefore get 

1 



Pn,pik) 



2pnV2^ A;3/2 



(6.1) 
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Lemma 6.1. If k/n^^^ is hounded, then Pn^p{k), Qn,p{k) and Rn,p{k) can be rewritten in 
the form 



exp { - ^ (l - e(.) - e {^) ) H- Oo (5^) - 8(1/*)}. (6.2) 



'kn 



Remark 6.2. It is evident that the multiphcative error terms 1 - e(e) - Q{kn-'^^^ / \n) 
above can be made arbitrarily close to 1 if e is small enough and A„ is large enough, i.e. if 
1 ^ A„ ^ 72^/3, or equivalently n~^^^ <^ e <^ 1. Likewise the additive error terms can be 
made arbitrarily small when 1 -C /c -C n"^^^. We follow the standard convention that "if 
a -^b then c = (1 + o{l))d" means that the o(l) error term can be made arbitrarily small 
by taking the ratio b/a large enough, and in our earlier notation may be re-expressed as 
"c=(l + o,/,(l))rf." 

Proof of Lemma \6. 1\ . With the assumption that k/n^^^ is bounded, we can use our bound 
( gnop on Sp{k) together with the bounds k/n < k^/'^/n, k^/n^ = 0{k^/n'^) = Oo{k^^'^/n), 
and k^/n'^ = Oo{k^^'^/n) to rewrite (|0| ) as 

P„,,(A:) = exp { - ^(1 - 0(^)) + 5r + ®° (^) " ®^^/^)}- ^6-3) 



2pnV2n k'^/^ 

Note that the ratio between the second and first terms in the exponential is k/{ne) = 
A;/(A„n^/'^), so we can further rewrite ( |6.3|) as (|6.2|) . The estimate (|6.2| ) for Qn,p{k) follows 
immediately from the estimate for P„ p(/c) and ( |5.23|) , and the estimate for i?„ p(/c) follows 
from that for Qn,p{k) and (|5.15|) . □ 

Lemma 6.3. There are constants c, Eq and Aq with c > 0, < eo < 1 (^iT-d < Aq < oo, 
such that the following statements hold for a > 1/2 and Aq < |A| < Equ^^^. 
i) If Xn < 0, then 

/vol '^-i/s 

fc>n2/3/|A„| 

where the constant implicit in the Landau symbol O(-) depends on a. 
a) For both positive and negative A„, we have 

fc<n2/3/|A„| 



-c|A„| 



(6.4) 



2pn 



'2tx 



(6.5) 



where the Oe,A„(l) term depends upon a, and for fixed a becomes as small as we like if W, 
is large enough and e = Xn/n^^^ is small enough. 



Remark 6.4. Here, as in the rest of this paper, T denotes the gamma function, which 
interpolates factorials. Recalling that r(l/2) = ^/7^, the lemma immediately implies that 
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below the window the expected size a component of a given vertex in G'„,2p-p2 can be 
estimated by 



E{\C^,2r>-A^)\) = ^^(1 + Osm{1)). (6.6) 
Note also that ( |6.4|) implies that below the window 

(6.7) 

Proof of Lemma |6'. We start with the proof of ii). The limit of the summation is 
n^/^/|A| = n^^^/\e\; we first sum the portion up to 1/e separately. For this portion, the 
sum is 

J2 k^Qn,,{k)=0{{l/er~'l^), (6.8) 

fc<l/|e| 

where here and throughout this proof, constants may depend upon a. 

Referring to our expression (|6.2|) for Qn,p{k), we see that for the remainder of the terms 
in the sum, the additive error terms in the exponential tend to zero when k/n^^^ < 1/|A| is 
small enough and k > l/\e\ is large enough, and so they contribute only a multiplicative 
error of 1 + o(l). The multiplicative error terms in the exponential can also be made 
arbitrarily close to 1 by taking |A| large enough and \e\ small enough. Therefore we have 

^-±^^B{2-S)< Yl k^QnAk)<^-^^^B{2 + S) (6.9) 

^^^^^^ l/|e|<fc<ni/3/|,| ZpnVZTT 



where S = 0£^a„(1) and 



B{t)= ^'''^'^ exp 

l/|£|<fc<ni/3/|£| 



ke 



21 



(6.10) 
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Since the summand in ( |6.10| ) is unimodal, we may approximate the sum with an integral 



B{t) 



ni/3/|e| 
ni/3|£|/t 



^a-3/2 



exp 



t 



dk + o( maxr~^/2 



exp 



t 



tu 

72 



a-3/2 , , 

, A du ^ 
exp[-Mj — — + U 



-1 a-3/2^ 



t 



-|a-l/2 ^|A|/t 



^a-3/2 gxp[-M] du + 



\e\/t 



a-3/2^ 



a-1/2 



[r(a-l/2) + o,,A„(l)] + 



a-3/2^ 



-1 a-1/2 



[r(a-l/2) + o,,A„(l)] 



(6.11) 



where the Oe,A„(l) depends on a. Combining ( |6.8D , (|6.9|) and ( |6.11| ), we get (|6.5| ). 

To prove i), we use the well know fact that the cluster size distribution in Gn,pi with 
p = 2p — p^, is stochastically dominated by a birth process with binominal offspring 
distribution Binomial(?T,,p), which in turn is stochastically dominated by a Poisson birth 
process with parameter ?T,log[l/(l —p)\= 2np{l + 0(l/n)). Writing this parameter as 
1 + £ , we therefore get the estimate 



^ ^ k Qn,p(yk) < Y 



k> 



fc>n2/3/|A„| 



< 



1 fc*^ / £^ \ 



1 + 0(1/^) ^a-3/2 



exp 



fc> 



^2/3 



fe2 



:i+on(i)) 



(6.12) 



Bounding the sum on the right hand side by O Q(^^^^)A^^]° e 
estimate (|6.4|). 



we obtain the 

□ 



Lemma 6.5. There are constants c, Eq and \q with c > 0, < Cq < 1 and < Aq < oo, 
such that for a > 1/2 and Xq < \X\ < Equ^^^, we have 



^ ^ k Pn,p{k) 



k=l 



l + o,,A„(l)r(a-l/2) 
2pn 



27r 



a-l/2 



(6.13) 
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and 



fc>n2/3/|A„| 



'^-^/^ fO(e-®(l^"l)) zf A„<0 



0(e- 



if Xn > 0, 



(6.14) 



where the constants implicit in both the Oe^x„{l) term and the O term depends upon a, and 
where e = \nn~^/^ as usual. 



Remark 6.6. The particular values of a that we shall need are a = 1, a = 3/2, and a = 2. 
For these values, we have 




a = 1 



2/n E-^ a = 3/2 



\e\ a = 2 , 

where as before the summation can range over all k, or over all k < n^/^/|A„|. 



(6.15) 



Remark 6.7. Using the same trick as in (|6.7| ), we can write 



k 



Pn,p{k) — -j^Pn,p{k) 



fc>n2/3/|A„| 



fc>n2/3/|A„| 



0(e-®(l^"l)) if A„<0 
0(e-e(^"''')) if A„>0. 



(6.16) 



Proof of Lemma \6. 3^ . To prove the lemma, we again consider three regions of fc: k < 
n^^^/l^nl, n'^^^/\Xn\ < k < n^/^|A„|, and k > n^/^|A„|. In the first region, we use (|5.23|) to 



approximate Pn,p{k) by Qn,p{k). Combined with Lemma |6.3| , this gives 



r(a- 1/2). 



(6.17) 



For A„ < 0, we combine the second and the third region. Using the fact that Pn,p{k) < 
Qn,pik) by (|5.16|) , we then use Lemma [67^ to obtains the bound (|6.14| ) below threshold. 
Above threshold, the contribution from the second region is 



^ ] k Pn,p{k) ^ ^ ^ k Qn,pik) 

n2/3/|A„|<fc<n2/3|A„| n2/3/|A„|<fc<„2/3|A„| 

< (n2/3|A„|)« Yl Qn,p{k) 

n2/3/|A„|<fc<n2/3|A„| 

<(n2/3|A„|)"exp[-e(|A„|)]n^^/3 
-2^2/3n"-V2 



by (m) 



X2 



X exp[-e(|A„|)] . 



by (Ell) 

(6.18) 
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In the third region we use (5.18) to write 



J2 k''Pn,p{k) = o( Yl W ^-''^'^ QnAk)) ■ (6.19) 



fc>n2/3|A„| 



•fc>n2/3|A„| 



Consider the prefactor, i.e. the summand ignoring the factor of Qn,p{k). Provided £o(^ + 
1) < n^/^, recalhng the definition of ^Q in ( |5.19|) , we can differentiate the logarithm of the 
prefactor to get 



d 

dk 



log 



r+y/(i2(i-p)^)2 



-A:V/(12(l-p)2) 



" + ' -(log2)3fc^ 



k 



12(1 -p)2 



which will be nonpositive provided 

4(a + 3) , o n , ,9 

-^^-^<ky = —il + e)^ 
log 2 An^ 

which will hold if \Xn\ is large enough, where "large enough" depends upon a. Provided 
that these conditions are met, the prefactor takes on its largest value in the first term, 
k = n^^^\Xn\. Eventually, if k gets large enough, io{k) may stop increasing, and take on 
the value n^^^. If a > then the prefactor will then increase up to n""^ 

i/52-nV5^ Thus the 

maximum value of the prefactor is 



max 



(48 + o,,,„(l)) 



Since by assumption e = A„n < 1, we have n > G(A„^), so the second term in the 
max is n"" exp [-e(ni/5)]exp[-e(A„^/^)]. Thus we can write the maximum prefactor as 

max 1^(2/3)'^ exp[-e(A„')], n*^ exp[-e(ni/^)] exp[-e(A„'/')] } < n^^/^^'' exp[-e{Xj/% 



Upon substituting the above expression into (|6.19|) , we find 



n 



(2/3)a 



exp[-e(A„=^/^)]Q„,p(fc) 



fc>n2/3|A„| 



A:>n2/3|A„| 
<^(2/3)a-l/3g^p[_@(^^3/5)] 

-2^2/3n-V2 



A. 



exp[-e(A„=^/^) 



(6.20) 



where we used ( |5.24| ) to get the second line. 

The bounds ( |6.18| ) and ( p.20| ) imply ( |6.14| ) above threshold, and ( |6.14| ) and ( |6.17| ) imply 
the estimate ( |S.13[ ). □ 
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7. Expected Size of Spine 

After the preparation of the last three sections, we are ready to prove the bounds 
and 



in Theorems 



Proof of Theorem \3. 1\ . We begin with a derivation which holds for e of both signs, and 
later distinguish e < from e > 0. Since J2k>i Qn,p{k) = 1, we have 

f(x ^ = 1 - VPn,p(fc) = V (QnAk) - Pn.Ak)) 

By (i^, 

{Qn,p{k) — PnA^)) ~ 



k>l 



k< 



■2/3 



p. 



Pn,p{k) 



k< 



a/|(1 + oa„(1)) Y1 p[k'^' + 0{k)]P^Ak) 

k<^ 



(l + OA„(l)) 



(l + o.,Jl))^v|^ + 0(./.) 



by (CT) 



1 + 0, 



'e,\„ 



1 + 0, 



4n£2 4ni/3A„2 

So far everything we have done in this section holds for both the subcritical region and 
the supercritical region. We now consider these regions separately. 
Subcritical regime: 

We use ( |6.7|) to sum over the remaining values of k: 



fc>n2/V|A„| fc>n2/V|A„| 

Putting these two ranges together we get 

k 

— ^ ^ {Qn,p{k) Pn,p{k)^ + ^ ] {Qn,p{k) Pn,p{k)^ 
fe<n2/3/|A„| n2/3/|A„|<fc 



l + Oe,A„(l) 
1 + 0.,A„(1) 



+ 



-c|A„| 



1/3 
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which completes the proof of 

Supercritical regime: 

If k is "mid-size," we can apply ( |5.16|) and (|5.25| ): 



□ 



n2/^/A„<fc<A„n2/3 n2/3/A„<fc<A„n2/3 



If k is "large," we proceed as in the proof of ( |6.2CI| ) to write 

J2 (gn,p(A:)-Pn,p(A;)) = (l-0(exp[-e(A.^/^)])) ^ Q.,,(A;) 

= t9(e)(l + o,„(l)) by (^) 

= 2\nn-^'\l + oxA'^)+0{e)) . 
Putting these three ranges of k together, we find 



E + E + E 

fc<n2/3/A„ n2/3/A„<fc<A„n2/3 fc>A„n2/3 



{Qn,p{k^ Pn,p{k)^ 



+ ^(e)(l + OA„(l)) 



= t9(5)(l + o,„(l)) 
= 2A„n-i/3(l + OA„(l) + 0(£)) . 
which yields ( |3.5|) and completes the proof of Theorem 



□ 



8. Variance of the Size of the Spine 

In this section we shall prove the bounds (p.6|) and (|3.7|) on the variance of the size of 
the spine given in Theorem |3.2| . First, we note that the lower bound is immediate from the 
Harris-Kleitman correlation inequality (see |[Har60|| , |[Kle66|| ), which was later generalized 
to the FKG inequality |[I''KG71|| . We use x -w x as a shorthand for x x, and if M is a 



set of literals, x x means that there is a path from x to x using only literals in M. 



M 



Lemma 8.1. For strictly distinct literals x,y, we have 



Hx '^x,y-~-^y) -F(x -^x)^ < Pnp(k)(F(x x) - - — -F(x x) 

^ n n ' ^ n ' ^ — ^ V n n — \ n-k ' 

k>l 
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Proof. Let Pn^p{k;y) denote the event that L^{y) is strictly distinct, and \L^{y) \ = k. So 
in particular P(P„p(fc; y)) = P„_p(fc). Using the resolution of the identity 

oo 
k=l 

where Ia denotes the indicator function of the event A, we can decompose P(x x) in 

n 

two different ways to obtain 

F{x ^ X, 2/ ^ y) + V P(a: ^ x, h,,{k- y)) = P(x ^ x) (p(y ^ y) + V P(P„,p(A;; y)) 
SO that 

Flx^x^y-^y) - F(x ^ x)F(y y) =y (f{x ^ x) ~ F(x ^ x\Pn,p{k;y)))p^^p{k). 

k 

(8.1) 

To estimate the probability on the right, consider the event Pn^p{k;y). With probability 
{k — l)/(n — 1) either x or x is in L^{y). If x G L^{y) then x -/^ x. If x G L~^{y), the 
situation is more complicated, so we shall bound the probability below by 0. If L~^{y) 
contains neither x nor x, then any path from x to x avoids literals in L^{y). In this case 
we may as well explore the out-graph L+(x) restricted to avoid the variables in L^{y); 
with probability P(x x) the restricted out-graph will contain x. Thus we conclude 

P(x ^ x\Pn.pik; y)) = ^^P(x ^ x) + ^^^P(x ^ x|x G L+ (y) , Pn,pik; y)) 

n ' n — 1 n-k in — I n 

> "^—Jl^lx ^ x). 
n-1 ^ n-k ' 

Substituting this into ( |8.1| ) proves the lemma. □ 
Next we relate the probabilities of the events x ^ x and x x. 

n—k n 

Lemma 8.2. // A„ = en^^^ is large enough, and e = A„n~^/'^ is small enough, then 



Pfx x) - F(x ^x) < < 

^ n+1 ' ^ n ' 



2n\\n\^ 
2 + o.,A„(l) 



z/ A„ < 
A„ > . 



n 

Proof. Suppose x -/^ x. Let X denote L^p(x), which then must be strictly distinct. We 

n 

shall consider four cases depending on whether or not X — » Xn+i and whether or not 

X — s> Xn+l- 
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Case 1. X ^ Xn+1 and X -/^ Then x x. 

n+1 

Case 2. X ^ Xn+i and X — > Xn+i- Then x x. 

n+1 

Case 3. X — > Xn+i and X -f^ a^n+i- It is clear that if where [X] denotes 

X U X, then x X. Suppose that conversely x x. Since x -/^x, either Xn+i or afn+i 

n+1 n+1 ^ 

is in the path from a; to a; in [n + 1]. The first such occurence must be Xn+i, so we have 
Xn+i ^ X, and its contrapositive x ^n+i- Since x ^n+i within [n] U {xn+i}, it 

n+1 n+1 

must be that Xn+i occurs in the path from x to Xn+i- In particular Xn+i Xn+i- We 

n+1 

may assume without loss of generality that Xn+i and Xn+i occur only at the endpoints 
of this path. If a literal in X occurred in the path from Xn+i to Xn+i, consider the last 
such one. The next literal cannot be by assumption, nor can it be Xn+i, since this 
occurs only at the beginning of the path. So the next literal would have to lie inside X, a 
contradiction. If a literal in X occured in the path from to we could take the 

contrapositive and similarly derive a contradiction. Thus Xn+i ^n+i using only literals 
in [n + ll \ \X], i.e. Xn+i ^ Xn+i- Thus conditional upon L'^(x) — X, we can write 

[n+l]\[X] 

X Xn+1, X 7^ Xn+1, X X iS X ^ Xn+1, X Xn+1, Xn+1 Xn+1 
¥[X Xn+1, X 74 Xn+1, X X \L+{x) = X] = 

P[X ^ a;„+i]P[X 74 x„+i]P[x„+i ^ Xn+i] 

[n+l]\[X] 

since the events on the right and the event L^{x) = X are determined by pairwise disjoint 
sets of variables. 

Case J^. X -f-^ Xn+1 and X — > Xn+i- By symmetry, cases 3 and 4 have the same probability. 
Putting these four cases together we see 



^{x ^ x\Ll,{x) = X) = [1 - (1 - ' + 2 [1 - (1 - (1 - p)l^lT 



\x x\ 



n+1 "■'■^^ / L X 'A L X ' J X n+l-|X| 



<p^|X|^ + 2p|X|P[a; x\. (8.2) 

n+l-|X| 



As a consequence. 



P(x x] — P(a; x] = Pfa; x\x x\ 

^ n+1 ' ^ n ' ^ n+1 n ' 



J2 HlX,{x) = X)F{x x\LX,{x) = X) 

Xc[n] 
X strictly distinct 



<y^Pn,p{k){p''k'' + 2pk¥[x x]) 

'—^ n+1— K 

fc 

< p2 ^ PnM^^ + 2pP[2; -^AY. PnAk)k. (8.3) 
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Hence by ( |6.15[ ), we have 



^ --LI ^ ^ n ^ 2pn \e\^ n 



1+0(1) 1 



71+1 



2pn \e 

1 + 0(1)^1 + 0(1) fl±^ ifA„<Oby(| 

^1^1 \(2 + o(l))£ ifA„>Oby(| 

fl /f9^2|^|3l 

[l + o(l)] X 

[l + o(l)]x 



l/[2n2|e|3] if A„ < 0, 
if A„ > 0. 



2/n 

■l/[2n|A„|3] ifA„<0, 
2/n if A„ > 0. 

In the above, all of the o(l) terms are 0£^a„(1)- 



□ 



Corollary 8.3. Provided k < n^'''^/|A„|, we have 



Pr f X -w x) — P f X x) < 

^ n ^ n-k ' 



(1 + o.,a„(1)) 

(1 + 0.,A„(1)) 



k 



2n\\n 

2k 
n 



- if Xn < 

lfXn>0. 



Proof. We seek 



n-l 



P(x -w x) - P(x x) = V P(x x) -P(x x) 

^ n ^ ^ n-k ' ^-^ ^ m+1 ' ^ m ' 



in=n—k '- 



and need only show that each summand is well-approximated by P(x x) — P(x x). 

n+l ' ^ n 

Define A„' by 

1 + A„n-i/3 1 + A„'m-i/3 



P 



2n 



2m 



so that 



and thus 



- [1 + Kn-'/'] - 1 = A„' 



m 



-1/3 



m — n 



m\4/3 



An, • 



Since m = (1 + 0£,a„(1))'^, and |n — m| < A; ^ n^/^, it follows that A^' = (1 + Oe,A„(l))A„. 
Thus when we apply Lemma p.2| with m and A„' rather than n and A„, we obtain an 
answer that differs by a factor of 1 + Oe,A„(l), as desired. □ 
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Proof of Theorem \3.3[ . We can now complete our estimate of the covariances. For strictly 
distinct literals x and y we have 

Cov(x x,y y) = F(x x,y y) — P(x x)F(y -w y\ 

n n n n n n 

a; s) P(x x) 



by Lemma 



k>l 



^ — ^ n — \ n ^ — ^ n — 1 V n ' 

k>l k>l 

< P(x ^) ^ ^ Pn,p{k)k + ^ Pn,p{k) {^{x -^x) - P(x x) 



As a consequence, 
Cov(x x,y -^y) < 

n n 

< P(a; ^ a^) 1 V Pn,p{k)k + V P„,p(A;) (p(x ^ x) - P( 



k>l 



k>l 



— irix X 

n— A; 



)) 



fc<n2/3/|A„| fc>n2/3/|A„| 

l + o{l))k 



Pn,p{k)lF{x -w x) - P(x ^ X 



)) 



<P(x^x)i±;^+ 5^ P„,,(fc) 



fc<n2/3/|A„| 



2n|A„|3 
(2 + o(l))A; 



n 



J2 Pn,p(A;)P(x ^ x) 

fc>n2/3/|A„| 



1 + 

{2 + o{l))e 

1 + 0(1) 

2 + o(l) 

ne 



l + o(l] 



ne 



+ 



1 + 0(1) 
2n\Xn\^e 

2 + o(l) 



exp[-e(A„)]n 



-1/3 



l+o(l) 



exp[-0(A„=^/^)]r2-i/3x (2 + o(l))£ 



1 + 0(1) 
2n2/3A„4 

2 + o(l) 
n2/3A„ 



where the upper entry applies to the subcritical regime, and the lower entry applies to the 
supercritical regime. In the above, the o(l) terms are all Oe,A„(l)- This gives the variance 
bound above threshold. The bound on the second moment below threshold follows by 



combining the above variance bound with Theorem 



□ 
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9. The Existence of Hourglasses 
In this section we prove Theorem |4.3|, which was central to the proof of the upper 



bounds in Theorem 



Proof of Theorem \4-.^ (i) on the existence of many hourglasses in the subcritical regime. 

Let t be a large positive number, but still small compared to n^^^, and let p = (1 — 
tn~^^^) /2n. We shall construct a process for growing hourglasses one by one. However, 
as we deplete variables, the distribution changes, so that the hourglasses so constructed 
are not identically distributed. We therefore use the following two variations of this naive 
procedure: In the first process, instead of drawing from n variables, we draw only from 
n' = n — tn^/'^ variables, so that we have a "buffer" of tn?^^, and replenish the variables 
as necessary. However, even this process can lead to trouble in the unlikely event that 
we have a very long run which uses up our entire reserve of variables. So we construct 
another process which aborts the growth of an hourglass when it would use up too many 
variables. 

To be explicit, consider the trimmed out-graphs and trimmed in-graphs of various 
literals, where the in- and out-graphs are restricted to sets oi n' = n — tn^^^ variables. 
Recall from Lemma |2.(j| that the unoriented projection of the trimmed out-graph ^ (x) 

of a literal x is identically distributed to the connected component Cn',2p-p2(x) in Gn',2p-p^- 
We shall follow the convention that no matter how many trimmed out-graphs or trimmed 
in-graphs we have explored in the past, when exploring another in-graph or out-graph, we 
shall always restrict our attention to variables that are in none of the in-graphs or out- 
graphs found so far (except, possibly, for the root), and we shall add variables to ensure 
that there are n' = n — tn^^^ variables for the tree to grow within. (Recall that for the 
upper bounds we assume that we have a variable for each natural number.) In this way 
the sizes and structures of all the trees will be independent and identically distributed. 
As we alluded to above, later we shall consider a variation on this process. 

Since there are somewhat fewer than n variables in which we explore the out-graphs 
and in-graphs, this decreases the average out-degree of each literal, and has the effect of 
shifting the formula further into the subcritical regime. Specifically, if we define t' by 

1 - 1 - t'in'Y^I^ 



P 



2n 2n' 
we see that t' = {2 + o(l))t. 

Pick a literal u, and look at its trimmed out-graph T within n' unused variables. Recall 
the definition of Rn,p{k), the probability that Cn^2p-p'^iu) is a tree of size k. By using 
Lemma |0| for Rn^p{k), we see that, for some c, there is a {c + o{l)){t /n^^^) chance that T 



is a tree of size between 2n?^^ /t^ and An^/'^ /t^. Here, as explained in the first sentence of 
the proof, we are assuming 1 <^ t ^ n^/^, so by Remark |6.2| , in the remainder of this proof 
all o(l) terms without subscripts are of the form Oj/„i/3_t(l). By comparing with random 
graphs, if T is a tree, then it is uniformly distributed amongst the spanning trees on |T| 
vertices. Using the structural properties of random spanning trees (see e.g. [|Ald90||), if 
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we pick a random vertex w ^ u in T, with probability at least (1 — 0|T|(l))e ^ the path 
connecting u to w has length at least ^y2\T\. Let v be the middle vertex in the path 
from u to w (in case of tie, we choose the vertex closer to w). Either the majority of 
the remaining vertices are connected to the first part of the path or to the second part of 
the path. Since the spanning tree is uniformly random, by symmetry, with probability at 
least 1/2, at least half of the remaining vertices of the tree will be connected to v via the 
path from v to w — these vertices will be in the out-graph of v. In the event that the 
path from u to v has length at least (|T|/2)-'^/^ > n^^^ /t, and v has at least ||T| > r?!"^ jt^ 



descendents in the trimmed out-graph of m, say that vertex v is 
the path from m to f is the tail. We thus have shown 



'promising," and that 



P 



a random vertex v in D\ 



\u) IS promismg 



t 



2enV3- 



In the event that vertex v is promising, we proceed to explore the trimmed in-graphs 
of the first v}!'^ jt vertices on f's tail. Again by Lemma |0| , each individual in-graph will 
have size at least n^/^/t^ with probability at least (c-|-o(l))(t/n^/^). The probability that 
none of them is so large is at most (1 + o(l))e~'^. If any of them is so large, we shall call 
V a "central variable," and v together with its explored out-graph and in-graph both of 
size at least n^f^/t^ an "hourglass" (see Definition |4.2| ). Each time that we pick a literal 
and look for an hourglass as described above, we find one with probabihty at least 



cil 



t 



2e 

Next let us compute how many variables we expect to use up while exploring the 
trimmed out-graph and trimmed in-graphs. At this point we recall from Lemma |6.3| and 
equation (|6.6|) that if G is either the trimmed in-graph or trimmed out-graph of a vertex. 



E[\G\] = {1 + o{l)) 



[n 



/\l/3 



:i+o(i)) 



n 



1/3 



We always explore one out-graph, and with probability (1 + o(l))[c/ {2e)]t/n^/^ we explore 
\n^^^/t \ in-graphs. Thus the expected number of variables used up is 

^1/3 



1 + 



2eJ 2t 



If we look for an hourglass for 



4e 



c(l-e- 



-n 



1/3 



times, then the probability that we fail to find an hourglass is (1 + o(l))e ^, and the 
expected number of variables that we use up is 



:i+o(i)) 



'2e 



+ 1 



n 



2/3 



t2 
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The probability that we use up more than 3(1 + o(l)) times the expected number of 
variables is at most 1/3 + o(l). Therefore, with probability at least 1 — — 1/3 — o(l) > 
1/2 (for large enough t and small enough tn~^/^) we both find an hourglass, and do not 
use up more than 6n^/^/t^ variables, where 6 = 3[1 + 2e/c]/(l — e^'^). 

Now consider the following modification of the above procedure: as above we use the 
local search procedure in Section 2 to explore the trimmed out-graphs and trimmed in- 
graphs, hoping to find an hourglass, but as soon as we use up bn^^^/t"^ variables, we abort 
and stop looking for the hourglass. Then we can repeat this procedure t^/b times, be 
guaranteed to use no variables other than the first n of them, and find a number of disjoint 
hourglasses that stochastically dominates the binomial distribution Binomial(t^/6, 1/2). 
By Chernoff's inequality ||Che52|| (see also ||McD89|| ), the probability that we find fewer 



than half as many hourglasses as we expect will be no larger than exp[— 1^/(86)]. 

Proof of Theorem \4.3{ (ii) on the existence of a giant hourglass in the supercritical regime. 

Let t be a large positive number, but still small compared to n^^^. When p = {1 — 
tn-^/^)/2n, as we have just seen, there will be Q{t^) hourglasses with in- and out-portion of 
size at least n^^'^/t^, except with probability exp(— B(t'^)). We now increase p by a suitably 
large constant times tn~^/'^, say MtrT^^^ . For any two hourglasses, the probability of an 
edge from the out-portion of the first hourglass to the in-portion of the second hourglass is 
therefore at least M/t^. Then the central variable of the first hourglass implies the central 
variable of the second hourglass. Conceptually we can think of the directed graph whose 
nodes are the hourglasses, and place a directed edge from one node to another whenever 
the hourglasses connect up like this. The edges of this graph occur independently of one 
another, and the average out-degree of the graph is Q{Mt^ /t^) = 0(M). By choosing M 
large enough, we can make the average out-degree to be any convenient constant that we 
like. In particular, if the average out-degree is a constant larger than 1, then we might 
expect the connections to percolate, so that there is some node v that can reach a constant 
fraction of the other nodes through edges of this graph, and is reachable by a constant 
fraction of the other nodes. Provided this happens, each literal in the out-portions of 
the nodes reachable by v is implied by the central variable of node v, and each literal in 
the in-portions of the nodes that can reach v will imply the central variable of v, thereby 
giving the desired giant hourglass with in-portion and out-portion each of size Q{tn'^^^). 

It remains to be shown (in Lemma that we get the requisite percolation except 
with probability that is exponentially small in the number of nodes of the graph. —>■ 

The following lemma is related to one proved by Karp in |[Kar9CI|| which showed that 
with high probability there is a giant component of size G(iV) in supercritical directed 
percolation. For our purposes "with high probability" is not sufficient; we need the 
exceptional events to be exponentially rare. 

Lemma 9.1. In a random directed graph on N vertices, in which each directed edge occurs 
independently with probability 6/N, then except with probability that is exponentially small 
in Q{N), there is a vertex v with out-graph of size Q{N) and in-graph of size Q{N). 
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Proof. For convenience, let A^' = [A^/3], so that there are at least 3A^' — 2 vertices, and 
the probability of each directed edge is at least 2/N'. (We can throw out some of the 
edges to make the probability exactly 2/N'.) To search for a node v with a large in-graph 
and out-graph, we consider candidate vertices one at a time, and explore the in-graph 
and out-graph of the candidate, restricting the explorations of the in-graph and out-graph 
to disjoint sets of A^' — 1 vertices, none of which have yet been explored in the course of 
examining a previous failed candidate. In this way we ensure that the sizes of the in-graph 
and out-graph are independent, and both distributed in the same manner as the size of 
the component containing a particular vertex in the random graph Gn'^2/n'- We do the 
explorations in a parallel interleaved fashion, so that if either the in-graph or out-graph is 
found to be too small, then exploration of the other is immediately halted. We shall show 
that for large enough N', except with probability exponentially small in A^', after looking 
at N' /50 candidates, the failed candidates have not wasted more than A^' variables, and 
we find a successful candidate with in-graph and out-graph each of size at least N' /lO. 

We first claim that for any real 7 and integers k and A^' such that < 7 < A^' and 
1 < k < N', there is an s between 1/2 and 1 so that 

//V' — 1 \ / -V \ k-l / ^ \ kN'-sk'^ 

P(|C(x)| = A:inG^,,/^,)=^'"\^_ J(^) - ^) • O-l) 

Indeed, the probability that the component C{x) containing the vertex x has size k can 
be bounded below by the probability that C(x) is a tree of size k. To bound P(|C(x)| = k) 
from above, we note that the connectedness of C{x) implies that C{x) contains a tree of 
size k. Summing over all possibilities for this spanning tree, we therefore get 

J2 ^{T = C{x)) <F{\C{x)\ = k) < J2 ^{T ^C{x),\T\ = \C{x)\). 

trees T ^ x trees T B x 

\T\=k \T\=k 

This give a lower bound of 

uk-2lN' -A f 7 ^KN'-k)+{{';)-{k^i)) 

" ^ k-l [n^J V~N' 



and an upper bound of 



k-l \N'J V N- 



which establishes the claim (|9.1| ). 
Next define X by 



X 



\C{x)\ if |C(a;)| < A^'/IO 
if |C(x)| > AT'/IO 
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Using the bound Q on P(|C(x)| = k) we see that 

kN'-k' 



< :L_/-ie"'=7+7'=V^'. (9.2) 



For k < N'/IO, we get 



p(|C(x)| = A;) < ' ie"(i-7+iog7+7/io)_ 
^ ^ /c! 7 



Assuming 7 = 2, this gives 

H\Cix)\ = k)<^^^le-'/'' (9.3) 

so that 

fc=i fc=i fc=i 

Consequently 

^[e^/'°] < ^ + P(|C(x)| > N'/IO) X e°/^° < 3/2. 

Let Yi be the number of variables lost on the ith candidate if it is a failure; = if 
the ith candidate is successful: 

ith candidate successful 

Yi = 2k — 1 ith. candidate failed because in-graph had size k < N'/IO 
2k ith. candidate failed because out-graph had size k < N'/IO. 

Thus P(F = j) < ¥{X = \j/2]), and hence 

E[e^»/20] = ^P(F, = j)ei/20 < 2 P(X = j/2)e^^^' = 2E[e^/i°] < 3. 

j j even 

Letting 

I3N' 
i=l 
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be the total number of variables lost on failed candidates, we see 

P(S>iV')=P(e^/'°>e^'/20) 



< 



gA"/20 
giV720 



which is exponentially small in A^' for (3 = 1/50. 

Next consider P(|C(a;)| > N'/IO). By and Q we have 

P(l^(^)l>Y^ =1- E H\C{x)\=k)- Yl H\C{x)\=k) 

^ ^ fc=l fc=101ogAf' 

lOlogAT' AT'/IO 



>l-exp[2001og^iVViV'] E^^'"'^""'- E ^f-^""'^'° 

A:=l ■ fe=101ogAr' 

OO 7 jfc— 1 1 

> 1 - exp[200 log' N'/N'] ^ ^^'''^(^'^^ " ^ 



A;! ' 2N' 

k=l 

1 -exp[2001og'A^7A^'](l -^9(7 - 1)) - ^ 



2N' 

= ^9(7 - 1) - 0(log' A^ViV'). 

Thus with probability (1 — o{l))i){'~f — 1)' > 0.63 (for large enough A^'), both the in-graph 
and out-graph have size at least N' /lO. 

Therefore, if we try iV'/50 candidates, the probability that we lose too many variables 
on failed candidates, or have enough variables but still fail to find a vertex with in-graph 
and out-graph of size at least iV'/lO, is bounded by 

qAf750 

I n 37^750 

which establishes the lemma. □ 



Appendix A. Relation between Fn^p and F^^^ 

While the literature focuses on where a given number of clauses are specified, most 
of our theorems and proofs are done for Fn^p, where each clause has some independent 
chance of appearing in the formula, and the total number of clauses is random. In this 
appendix, we discuss the relation between the models Fn^m and With respect to 

monotone properties, these two models are practically interchangeable, provided m is 
about 4(2)^, the expected number of clauses in Fn.p. Write N for the number of 2-clauses 



m=0 



N 
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on xi, . . . ,Xn, SO that N = 2n{n — 1) , and let Mn,p be a binomial random variable with 
parameters and p. Then we have 

N 

P(SAT(F„,p)) = 5^P(M^,p = m)P(SAT(F„,™)) , 

m=0 

N 

E{S{F„,,p)) = 5^P(M^,p = m)E(S(F„,„)) , 

and 

E{S\Fn,p)) = $^P(M^,p = m)E{S\Fn,m)). 

m=0 

Since P(SAT(F„ ,„)) is a monotone decreasing function of m, for every < m < we 
have 

P(SAT(F„,„)) - P(Mjv,p >m)< P(SAT(F„,p)) < P(SAT(F„,„)) + P(Mjv,p < m). 
Similarly, we have 

P(UNSAT(F„,^)) - P(Ma.,p < m) 

< P(UNSAT(F„,p)) 

< P(UNSAT(F„,^)) + F{Mn,p >m) , 

E{S{Fn,m)) - 2nF{MN,p < m) < E{S{Fn,p)) < E{S{Fn,m)) + 2nF{MN,p >m) , 
and 

E{S\F„.,m)) - 4n2p(M;v,p <m)< E{S\F^^p)) < E{S\Fn,m)) + 4n2p(M;v,p >m) . 

In bounding the probability in the tail of the binomial distribution, we shall make use 
of the following Chernoff type inequality (see e.g. [[McD89|| ): 

P(|Mjv,p - pN\ > ppN) < e-p'P^/\ (A.l) 

provided < p < 1/2. 

To bound the probability of unsatisfiability in the subcritical regime, the expected size 
of the spine, and the second moment of the size of the spine, we set 

1 + Xn-^/'^ 

and 

m = (1 + A'n"^/^)n, 

with 
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Then the probabihty that M^^p is too large or too small (compared with m) is 

exp(-e(r2i/^)) = 0(1/^2). 

From this we see that our bounds for Fn,p imply the desired bounds for F„ ^ in the 
subcritical regime. 

To convert the bounds on the probability of satisfiability on the right from Fn,p to 
Fn,mi because this probability is so small, in order for it to dwarf P(M7v,p < m) and 
P(Mjv,p > m), we need to have a larger gap between A and A'. We set 

^' = ^±;^- 

Then the probability of a large deviation is at most 

exp[-e(Ai2/5^i/^)] = o(exp[-0(A3)]), 

if A is small enough compared to in}/^. Furthermore, A'/A is arbitrarily close to 1 provided 
A is sufficiently small compared to n^^^. Thus our bounds for satisfiability of F„ p in the 
supercritical regime carry over to corresponding bounds for Fn^m- 

Appendix B. Proof of Lemma |5T5| 

In this appendix, we prove Lemma |5]^. We start with the proof of statement (i). To this 
end, we again use that the cluster size distribution in Gn,p, p = — p"^, is stochastically 
dominated by a Poisson birth process with parameter 

k = n\ogl/{l-p) = 2nlogl/(l -p) = 2np{l + 0{l/n)). (B.l) 

Writing this parameter as k = 1 + e, and observing that the probability that a Poisson 
birth tree with parameter k has size k is {1 / k){k^~^ / k\){ke~'^)^ , we therefore get 

J2 QnAk)<m + l E 

fc>n2/3/A„ fc>n2/3/A„ 



where 



A:=0 



is the survival probability in a Poisson birth process with parameter k. If £ < ^o, then 

fte^-^ < e-^' (B.4) 

for some constant c = c{eo), so that 

E QnAk) < m + 0( J-— e~^'A < mil + 0{e-^'-)) (B.5) 



fc>n2/3/A„ 
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where we have used that = 6(f) = 6(A„r7, ^^^) in the last step. Since e = e + 
0{n~^) = e{l + 0(A~^)), we conclude that 

QnAk)<m{l + 0{X-')) (B.6) 

A:>n2/3/A„ 

= Q{Xnn-'/') . (B.7) 

To prove a lower bound, we show that 

J2 QnAk)<l-mi^ + 0{X-')). (B.8) 

Together with (ii), which will be proved below, this gives the desired bound. To prove 
( P.8|) , we will first show that for k < ra^/^/A^, 



1 + e' k\ 

where e' is defined as the positive solution of 



1 + 



n 



1 + e 



1 + e e-"e' 



(B.9) 
B.IO) 



Indeed, using ( p.l3|) , we bound 

1 fn 



n\k^ 
{2pnkY~^ 



k-l 



k\ 



n 1 



i=0 



\k~l 



(2pnky 



k{k-l)/2n^-p{2nk-k'^~3k+2) 



fB.ir 



For k < n'^^'^/Xn, we have S2p-p2{k) = 1 + O^k^^"^ /n). Combined with the observation 
that pk"^ — fc^/2n = ek'^/2n < ke"^ /2X^ we therefore get 



\fc-i 



k\ 

1 fc'^-i 



'l + e)e 



1 + 

(1+£-£2A-2/2) 



f.3/2 



n 

k r 

1 + of^ 

V n 



l + e k\ 

Using the definition ( [B.10| ) of e' and observing that e' < e, we get ( [B.9| 
As a consequence of ( [B.9| ), we now have 

^3/2 



:b.i2) 



1 + 



fc<n2/3/A„ 



fc<n2/3/A„ 



n 



<i-.(.') + o( E VT:bV«^ + ^>"^"^')^ 



(B.13) 
(B.14) 



A;<n2/3/A„ 
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Observing that e < Eq implies e' < sq, which imphes a bound of the form (|B.4D for 
k' = 1 + e', we now bound the sum over k as follows: 

fc<n2/3/A„ fc<n2/3/A„ ^ 

= 0(-) y e-'' = 0(- 

fc<n2/3/A„ 

= 0(^) =^(e)0(A;3). (B.15) 

Together with the observation that e' = 6{1 — 0(A~^)) for e small enough, the bounds 
( [BliD and ( [BlsD imply (TO- 

Next we prove statement (ii). To this end, we use a refinement (due to Alon and 
Spencer [ |AS92|| ) of the viewpoint employed by Karp ||Kar90|| and used here in the proof 
of Lemma Define Nq = n — 1, and for positive t, Nt = Binomial (A^^t-i, 1 — p). Then 
let Yt = n — t — Nt, and define T to be the least t such that Yt = 0. This random variable 
T has the same distribution as the size of the connected component containing a given 



vertex in Gn,p [|AS9 2 



Condition the Nt process to be small enough that Yt is positive whenever t < ti^^^/X 
(i.e. T > n^/^/A). How does this affect the distribution of Nt for larger values of t? We 
can think of the NtS as being determined by a collection of i.i.d. 0-1 random variables 
with probability 1 — p. Each A^^ is monotone increasing in these variables. By FKG, the 
above conditioning can only decrease the distribution of Nt (increase the distribution of 
Yt) for any given value of t, and in particular makes it less likely that Yt < for some t 
in a given range. Thus we have 

¥(n^/yX <T< n^/^A) = P^n^/^/A < T)p(3t: Yt < 0, n^/'/X < t < Xn^/'\n^/yX < t) 

= 0(^-^)p(^3t: Yt < 0, n^/yX < t < An^/^j by (gj^ 

= O (-^) [P (y„./3/A < O) + P (3t : = 0, n^/yX < t < An^/^j ] , 

(B.16) 

where in the last line we used Ff+i >Yt — 1. 

Let Xt denote the event that Yt = 0. Let Zt denote the event that Yt = and Ys > 
for r?!"^ jX < s < t. Let 

S= E Ix„ 

n2/3 /A<t< An2/3+n2/3/A2 

where as before Ia denotes the indicator of the event A. We have 

s> y^ iz. 5^ ix,,. , 

n2/3/A<t<An2/3 0<A<n2/3/A2 
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SO that 

E[s]> ^[^*]^ E 

n2/3/A<t<An2/3 [o<A<n2/VA2 

P[ZJ Yl ^{Yt+A = 0\Yt = 0) (B.17) 

n2/3/A<t<An2/3 0<A<n2/3/A2 

> Y nZ,]mm Y ^{Yt+A = 0\Yt = 0) 

n2/3/A<t<An2/3 0<A<n2/3/A2 

> P[3t : Ft = 0] min ^ p(y,^^ = 0|Ft = O) , (B.18) 

0<A<n2/3/A2 

where in ( p.l7| ) we used the fact that the Y^s are Markovian, and in the last hne the 
range of t is given by n^/^/A <t< Xn^/^. 

Next we estimate PfFt+A = ^\Yt = 0] and F[Yt = 0\Yo = 1]. Since 

Ys+A = Ys-A + Binomial(r2 - s - F,, 1 - (1 - p)^), 

we seek 

Binomial(m, r) = i?[Binomial(m, r)] + x ) 

where m = n ~ s — Ys, r = 1 — [1 — p)'^, and x = A — Yg — mr. 

We have m = (l + o(l))n and r = Ap(l + 0(Ap^) = 0(Ap). Thus i?[Binomial(m, r)] = 
mr = 0(A) and Var[Binomial(m, r)] = 0(A). We approximate x by 

X = A-Ys- {n - s - Ys){Ap + 0{AY)) 
= -Ys + A(l -pn + ps+pYs + 0{Anp^)). 

Now assuming F^ = or 1, and s = 0(An^/^), and writing p as (1 + £)/n, we further 
approximate 

x = -Ys + A{-e + 0{Xn-^/^) + 0{l/n) + 0{Xn-^/^)) 
= -F, + AOie) 
= AO{e) ifF, = 0. 

The normal approximation to the binomial is valid when 

X ^ Var[Binomial(m, r)]^''^; 



see Feller [ IbelGSj , Volume I, chapter VII. 3]. In particular, if x = 0(Var[Binomial(m, r)] 



l/2^ 



0(A^/^), which happens when A = 0(l/£:^) = 0(n^/^/A^), we have 
P[Binomial(m, r) = £'[Binomial(m, r)] + a;] = 0(Var[Binomial(m, r)]^"*^/^) = 0(A^^^^) 
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which imphes 

p(n+A = o|n = 0) = e(A-^/2) 

and hence 

J2 P(Fi+A = 0|Ft = 0) = e(Ai/2) ^ =Q{n'/yx). (B.19) 

Next we estimate £'[5']. If s = and t = A, we bound x by 

x = t-l-{n- 1)(1 - (1 - < t - n + n(l -p^)* < t - ntp + nQ^' 

< t - nt{2p - p^) + 2nt{t - l)p^ < t - nt2p + 2ntV < + + £^)V(2^) 

< -te/9, 

provided t < {\ + 1/X^)n^^^ < n and e < 1/3. The deviations x from the mean are too 
large for the normal approximation to be valid. In this case one would typically use the 
Chernoff bound, but we also need the 1 / VVar term not found in the standard Chernoff 
bound. Thus we note that the following variation of the bound is easily deduced from the 
derivation given by Feller of the normal approximation to the binomial distribution. 

p[F, = 0] = O(4e-®(^'/*)) 



= 0(-^e-®(*"')) recalling |x| > te/9 
= 0( / e-QW) since t>n2/VA 
= 0(n-i/3e-®W). 

(B.20) 

Now summing over t, we get 

E[S]= P[rt = 0] = 0(ni/3e-©W). (B.21) 

n2/3/A<i<An2/3+n2/3/A2 

Combining the bounds ( pl8|) , (|Rl9D , and ( [RlTD , we see that 

¥[Yt = for some t such that n^/^/A < t < Xn^^^] = 0(exp(-e(A))). (B.22) 
We also need 

nYnVyx < 0] < exp(-e(A)), (B.23) 

which follows from the straight Chernoff bound. Substituting (|B.22|) and (|B.23|) into 
( P.16|) we obtain the desired inequality 

¥[n^/yX < |C(0)| < Xn^/''] = 0{n-^/'' exp{-e{X))) . 
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